CLEIC ACID AND CORRESPONDING PROTEIN ENTITLED 101P3A11 
USEFUL IN TREATMENT AND DETECTION OF CANCER 
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This application claims benefit of priority from U.S. Provisional Application Serial Number 60/291,1 18, 
filed May 1 5, 2001 and is a continuation-in-part of U.S. Application Serial Number 09/680,728, filed October 5, 
2000, which claims benefit of priority from U.S. Provisional Application Serial Number 60/157,902, filed October 
5, 1999. All three of these applications are hereby incorporated in their entireties as if fully set forth. 

10 

FIELD OF THE INVENTION 

The invention described herein relates to a novel gene and its encoded protein, termed 101P3A1 1, and to 
diagnostic and therapeutic methods and compositions useful in the management of various cancers that express 
101P3A1 1 (also referred to as PHOR-1). 

15 

BACKGROUND OF THE INVENTION 

Cancer is the second leading cause of human death next to coronary disease. Worldwide, millions of 
people die from cancer every year. In the United States alone, as reported by the American Cancer Society, cancer 
causes the death of well over a half-million people annually, with over 1.2 million new cases diagnosed per year. 

20 While deaths from heart disease have been declining significantly, those resulting from cancer generally are on the 
rise. In the early part of the next century, cancer is predicted to become the leading cause of death. 

Worldwide, several cancers stand out as the leading killers. In particular, carcinomas of the lung, prostate, 
breast, colon, pancreas, and ovary represent the primary causes of cancer death. These and virtually all other 
carcinomas share a common lethal feature. With very few exceptions, metastatic disease from a carcinoma is fatal. 

25 Moreover, even for those cancer patients who initially survive their primary cancers, common experience has 
shown that their lives are dramatically altered. Many cancer patients experience strong anxieties driven by the 
awareness of the potential for recurrence or treatment failure. Many cancer patients experience physical 
debilitations following treatment. Furthermore, many cancer patients experience a recurrence. 

Worldwide, prostate cancer is the fourth most prevalent cancer in men. In North America and Northern 

30 Europe, it is by far the most common cancer in males and is the second leading cause of cancer death in men. In the 
United States alone, well over 30,000 men die annually of this disease - second only to iung cancer. Despite the 
magnitude of these figures, there is still no effective treatment for metastatic prostate cancer. Surgical 
prostatectomy, radiation therapy, hormone ablation therapy, surgical castration and chemotherapy continue to be the 
main treatment modalities. Unfortunately, these treatments are ineffective for many and are often associated with 

35 undesirable consequences. 

On the diagnostic front, the lack of a prostate tumor marker that can accurately detect early-stage, localized 
tumors remains a significant limitation in the diagnosis and management of this disease. Although the serum 
prostate specific antigen (PSA) assay has been a very useful tool, however its specificity and general utility is 
widely regarded as lacking in several important respects. 

40 Progress in identifying additional specific markers for prostate cancer has been improved by the generation 

of prostate cancer xenografts that can recapitulate different stages of the disease in mice. The LAPC (Los Angeles 
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Prostate Cancer) xenografts are prostate cancer xenografts that have survived passage in severe combined immune 
deficient (SCID) mice and have exhibited the capacity to mimic the transition from androgen dependence to 
androgen independence (Klein et al., 1997, Nat. Med. 3:402). More recently identified prostate cancer markers 
include PCTA-1 (Su et al., 1996, Proc. Natl. Acad. Sci. USA 93: 7252), prostate-specific membrane (PSM) antigen 
5 (Pinto et al, Clin Cancer Res 1996 Sep 2 (9): 1445-51), STEAP (Hubert, et al., Proc Natl Acad Sci USA. 1999 
Dec 7; 96(25): 14523-8) and prostate stem cell antigen (PSCA) (Reiter et al., 1998, Proc. Natl. Acad. Sci. USA 95: 
1735). 

While previously identified markers such as PSA, PSM, PCTA and PSCA have facilitated efforts to 
diagnose and treat prostate cancer, there is need for the identification of additional markers and therapeutic targets 
10 for prostate and related cancers in order to further improve diagnosis and therapy. 

Renal cell carcinoma (RCC) accounts for approximately 3 percent of adult malignancies. Once adenomas 
reach a diameter of 2 to 3 cm, malignant potential exists. In the adult, the two principal malignant renal tumors are 
renal cell adenocarcinoma and transitional cell carcinoma of the renal pelvis or ureter. The incidence of renal cell 
adenocarcinoma is estimated at more than 29,000 cases in the United States, and more than 1 1,600 patients died of 
1 5 this disease in 1998. Transitional cell carcinoma is less frequent, with an incidence of approximately 500 cases per 
year in the United States. 

Surgery has been the primary therapy for renal cell adenocarcinoma for many decades. Until recently, 
metastatic disease has been refractory to any systemic therapy. With recent developments in systemic therapies, 
particularly immunotherapies, metastatic renal cell carcinoma may be approached aggressively in appropriate 
20 patients with a possibility of durable responses. Nevertheless, there is a remaining need for effective therapies for 
these patients. 

Of all new cases of cancer in the United States, bladder cancer represents approximately 5 percent in men 
(fifth most common neoplasm) and 3 percent in women (eighth most common neoplasm). The incidence is 
increasing slowly, concurrent with an increasing older population. In 1998, there was an estimated 54,500 cases, 

25 including 39,500 in men and 15,000 in women. The age-adjusted incidence in the United States is 32 per. 100,000 
for men and 8 per 100,000 in women. The historic male/female ratio of 3:1 may be decreasing related to smoking 
patterns in women. There were an estimated 1 1,000 deaths from bladder cancer in 1998 (7,800 in men and 3,900 in 
women). Bladder cancer incidence and mortality strongly increase with age and will be an increasing problem as 
the population becomes more elderly. 

30 Most bladder cancers recur in the bladder. Bladder cancer is managed with a combination of transurethral 

resection of the bladder (TUR) and intravesical chemotherapy or immunotherapy. The multifocal and recurrent 
nature of bladder cancer points out the limitations of TUR. Most muscle-invasive cancers are not cured by TUR 
alone. Radical cystectomy and urinary diversion is the most effective means to eliminate the cancer but carry an 
undeniable impact on urinary and sexual function. There continues to be a significant need for treatment modalities 

35 that are beneficial for bladder cancer patients. 

An estimated 130,200 cases of colorectal cancer occurred in 2000 in the United States, including 93,800 
cases of colon cancer and 36,400 of rectal cancer. Colorectal cancers are the third most common cancers in men 
and women. Incidence rates declined significantly during 1992-1996 (-2.1% per year). Research suggests that 
these declines have been due to increased screening and polyp removal, preventing progression of polyps to 

40 invasive cancers. There were an estimated 56,300 deaths (47,700 from colon cancer, 8,600 from rectal cancer) in 
2000, accounting for about 1 1% of all U.S. cancer deaths. 
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At present, surgery is the most common form of therapy for colorectal cancer, and for cancers that have not 
spread, it is frequently curative. Chemotherapy, or chemotherapy plus radiation, is given before or after surgery to 
most patients whose cancer has deeply perforated the bowel wall or has spread to the lymph nodes. A permanent 
colostomy (creation of an abdominal opening for elimination of body wastes) is occasionally needed for colon 
5 cancer and is infrequently required for rectal cancer. There continues to be a need for effective diagnostic and 
treatment modalities for colorectal cancer. 

There were an estimated 164,100 new cases of lung and bronchial cancer in 2000, accounting for 14% of 
all U.S. cancer diagnoses. The incidence rate of lung and bronchial cancer is declining significantly in men, from a 
high of 86.5 per 100,000 in 1984 to 70.0 in 1996. In the 1990s, the rate of increase among women began to slow. 
10 In 1996, the incidence rate in women was 42.3 per 100,000. 

Lung and bronchial cancer caused an estimated 1 56,900 deaths in 2000, accounting for 28% of all cancer 
deaths. During 1992-1996, mortality from lung cancer declined significantly among men (-1.7% per year) while 
rates for women were still significantly increasing (0.9% per year). Since 1987, more women have died each year 
of lung cancer than breast cancer, which, for over 40 years, was the major cause of cancer death in women. 
1 5 Decreasing lung cancer incidence and mortality rates most likely resulted from decreased smoking rates over the 
previous 30 years; however, decreasing smoking patterns among women lag behind those of men. Of concern, 
although the declines in adult tobacco use have slowed, tobacco use in youth is increasing again. 

Treatment options for lung and bronchial cancer are determined by the type and stage of the cancer and 
include surgery, radiation therapy, and chemotherapy. For many localized cancers, surgery is usually the treatment 
20 of choice. Because the disease has usually spread by the time it is discovered, radiation therapy and chemotherapy 
are often needed in combination with surgery. Chemotherapy alone or combined with radiation is the treatment of 
choice for small cell lung cancer; on this regimen, a large percentage of patients experience remission, which in 
some cases is long lasting. There is however, an ongoing need for effective treatment and diagnostic approaches for 
lung and bronchial cancers. 

25 An estimated 182,800 new invasive cases of breast cancer were expected to occur among women in the 

United States during 2000. Additionally, about 1,400 new cases of breast cancer were expected to be diagnosed in 
men in 2000. After increasing about 4% per year in the 1980s, breast cancer incidence rates in women have leveled 
off in the 1990s to about 1 10.6 cases per 100,000. 

In the U.S. alone, there were an estimated 41,200 deaths (40,800 women, 400 men) in 2000 due to breast 

30 cancer. Breast cancer ranks second among cancer deaths in women. According to the most recent data, mortality 
rates declined significantly during 1992-1996 with the largest decreases in younger women, both white and black. 
These decreases were probably the result of earlier detection and improved treatment. 

Taking into account the medical circumstances and the patient's preferences, treatment of breast cancer 
may involve lumpectomy (local removal of the tumor) and removal of the lymph nodes under the arm; mastectomy 

35 (surgical removal of the breast) and removal of the lymph nodes under the arm; radiation therapy; chemotherapy; or 
hormone therapy. Often, two or more methods are used in combination. Numerous studies have shown that, for 
early stage disease, long-term survival rates after lumpectomy plus radiotherapy are similar to survival rates after 
modified radical mastectomy. Significant advances in reconstruction techniques provide several options for breast 
reconstruction after mastectomy. Recently, such reconstruction has been done at the same time as the mastectomy. 

40 Local excision of ductal carcinoma in situ (DCIS) with adequate amounts of surrounding normal breast 

tissue may prevent the local recurrence of the DCIS. Radiation to the breast and/or tamoxifen may reduce the 
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chance of DCIS occurring in the remaining breast tissue. This is important because DOS, if left untreated, may 
develop into invasive breast cancer. Nevertheless, there are serious side effects or sequelae to these treatments. 
There is, therefore, a need for efficacious breast cancer treatments. 

There were an estimated 23,100 new cases of ovarian cancer in the United States in 2000. It accounts for 
5 4% of all cancers among women and ranks second among gynecologic cancers. During 1992-1996, ovarian cancer 
incidence rates were significantly declining. Consequent to ovarian cancer, there were an estimated 14,000 deaths 
in 2000. Ovarian cancer causes more deaths than any other cancer of the female reproductive system. 

Surgery, radiation therapy, and chemotherapy are treatment options for ovarian cancer. Surgery usually 
includes the removal of one or both ovaries, the fallopian tubes (salpingo-oophorectomy), and the uterus 
10 (hysterectomy). In some very early tumors, only the involved ovary will be removed, especially in young women 
who wish to have children. In advanced disease, an attempt is made to remove all mtra-abdominal disease to 
enhance the effect of chemotherapy. There continues to be an important need for effective treatment options for 
ovarian cancer. 

There were an estimated 28,300 new cases of pancreatic cancer in the United States in 2000. Over the past 
15 20 years, rates of pancreatic cancer have declined in men. Rates among women have remained approximately 
constant but may be beginning to decline. Pancreatic cancer caused an estimated 28,200 deaths in 2000 in the 
United States. Over the past 20 years, there has been a slight but significant decrease in mortality rates among men 
(about -0.9% per year) while rates have increased slightly among women. 

Surgery, radiation therapy, and chemotherapy are treatment options for pancreatic cancer. These treatment 
20 options can extend survival and/or relieve symptoms in many patients but are not likely to produce a cure for most. 
There is a significant need for additional therapeutic and diagnostic options for pancreatic cancer. 

SUMMARY OF THE INVENTION 

The present invention relates to a gene, designated 101P3A11, that is over-expressed in the normal tissues 

25 and cancers listed in Table I. Northern blot expression analysis of 101P3A1 1 gene expression in normal tissues 
shows a restricted expression pattern in adult tissues. The nucleotide (Figure 2) and amino acid (Figure 2, and 
Figure 3) sequences of 101P3A1 1 are provided. The tissue-related profile of 101P3A1 1 in normal adult tissues, 
combined with the over-expression observed in tumors of the tissues listed in Table I, shows that 101P3A1 1 is 
aberrantly over-expressed in certain some cancers, and thus serves as a useful diagnostic, prophylactic, prognostic, 

30 and/or therapeutic target for cancers that express the protein, such as those listed in Table I. 

The invention provides polynucleotides corresponding or complementary to all or part of the 101P3A1 1 
genes, mRNAs, and/or coding sequences, preferably in isolated form, including polynucleotides encoding 
101P3All-related proteins and fragments of 4, 5, 6, 7,8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21,22, 23,24, 
25, or more than 25 contiguous amino acids; at least 30, 35, 40, 45, 50, 55, 60, 65, 70, 80, 85, 90, 95, 100 or more 

35 than 100 contiguous amino acids of a 101P3A1 1-related protein, as well as the peptides/proteins themselves; DNA, 
RNA, DNA/RNA hybrids, and related molecules, polynucleotides or oligonucleotides complementary or having at 
least a 90% homology to the 101P3A1 1 genes or mRNA sequences or parts thereof, and polynucleotides or 
oligonucleotides that hybridize to the 101P3A1 1 genes, mRNAs, or to 101P3A1 1-encoding polynucleotides. Also 
provided are means for isolating cDNAs and the genes encoding 101P3 Al 1 . Recombinant DNA molecules containing 

40 101P3A1 1 polynucleotides, cells transformed or transduced with such molecules, and host-vector systems for the 
expression of 101P3A1 1 gene products are also provided. The invention further provides antibodies that bind to 
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101P3A1 1 proteins and polypeptide fragments thereof, including polyclonal and monoclonal antibodies, murine and 
other mammalian antibodies, chimeric antibodies, humanized and fully human antibodies, and antibodies labeled 
with a detectable marker. In certain embodiments there is a proviso that the entire nucleic acid sequence of Figure 2 
is not encoded and/or the entire amino acid sequence of Figure 2 is not prepared. In certain embodiments, the entire 
nucleic acid sequence of Figure 2 is encoded and/or the entire amino acid sequence of Figure 2 is prepared, either of 
which are in respective human unit dose forms. 

The invention further provides methods for detecting the presence and status of 101P3A1 1 polynucleotides 
and proteins in various biological samples, as well as methods for identifying cells that express 101P3A1 1 . A typical 
embodiment of this invention provides methods for monitoring 101P3A1 1 gene products in a tissue or hematology 
sample having or suspected of having some form of growth dysregulation such as cancer. 

The invention further provides various immunogenic or therapeutic compositions and strategies for 
treating cancers that express 101P3A1 1 such as cancers of tissues listed in Table I, including therapies aimed at 
inhibiting the transcription, translation, processing or function of 101P3A1 1 as well as cancer vaccines. In one 
aspect, the invention provides compositions, and methods comprising them, for treating a cancer that expresses 
101P3A1 1 (PHOR-1) in a human subject wherein the composition comprises a carrier suitable for human use and a 
human unit dose of one or more than one agent that inhibits the production or function of 101P3A1 1 (PHOR-1). 
Preferably, the carrier is a uniquely human carrier. In another aspect of the invention, the agent is a moiety that is 
immunoreactive with 101P3A1 1 protein. Non-limiting examples of such moieties include, but are not limited to, 
antibodies (such as single chain, monoclonal, polyclonal, humanized, or human antibodies), functional equivalents 
thereof (whether naturally occurring or synthetic), and combinations thereof. In another aspect, the agent is a small 
molecule as defined herein. 

In another aspect, the agent comprises one or more than one peptide which comprises a cytotoxic T 
lymphocyte (CTL) epitope that binds an HLA class I molecule in a human to elicit a CTL response to 101P3A1 1 
and/or one or more than one peptide which comprises a helper T lymphocyte (HTL) epitope which binds an HLA 
class II molecule in a human to elicit an HTL response. In a further aspect of the invention, the agent comprises 
one or more than one nucleic acid molecule that expresses one or more than one of the CTL or HTL response 
stimulating peptides as described above. In yet another aspect of the invention, the one or more than one nucleic 
acid molecule may express a moiety that is immunologically reactive with 101P3A1 1 as described above. The one 
or more than one nucleic acid molecule may also be, or encodes, a molecule that inhibits production of 101P3A1 1 . 
Non-limiting examples of such molecules include, but are not limited to, those complementary to a nucleotide 
sequence essential for production of 101P3A1 1 (e.g. antisense sequences or molecules that form a triple helix with a 
nucleotide double helix essential for 101P3A1 1 production) or a ribozyme effective to lyse 101P3A1 1 mRNA. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. 101P3A11 SSH sequence. The 101P3A1 1 SSH sequence is shown as SEQ ID NO: 2864. 

Figures 2A-2D. The cDNA (SEQ ID. NO. : 2865) and amino acid sequence (SEQ ID. NO. : 2866) of 
101P3A11. The start methionine is underlined. The open reading frame extends from nucleic acid 133 to 1086 
including the stop codon (the codon for the initial M is omitted as the shorter peptide has a more favorable Kozak 
sequence). 

Figure 3. Amino acid sequence of 101P3A11 (SEQ ID. NO. : 2866). The 101P3A1 1 protein has 317 
amino acids. 
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Figure 4. Alignment of 101P3A11 (Sbjct-SEQ ID NO: 2867) with mouse olfactory receptor S25 
(Query- SEQ ED NO: 2868.) The transmembrane regions of 101P3A1 1 and mouse olfactory receptor S25 
(ORS25)predicted using the TMHMM algorithm are highlighted in gray. The amino acids of ORS25 predicted 
(Floriano, W.B., et al, 2000, Proc. Natl. Acad. Sci., USA, 97:10712-10716) to be involved in binding of the ligand 
5 hexanol and/or involved in the formation of the ligand binding pocket are italicized and bolded in the Figure, and 
are: Leu 131, Val 134, Val 135, Gly 138, Thrl39, Ser 193, Ser 197, Phe 225, Ala 230, He 231, Gly 234, Thr 284, 
Phe 287, Gin 300, Lys 302. 

Figure 5. Hydrophilicity amino acid profile of 101P3A1 1 determined by computer algorithm sequence 
analysis using the method of Hopp and Woods (Hopp T.P., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 
10 78:3824-3828) accessed on the Protscaie website (www.expasy.ch/cgi-bin/protscale. pi) through the ExPasy 
molecular biology server. 

Figure 6. Hydropathicity amino acid profile of 101P3A1 1 determined by computer algorithm sequence 
analysis using the method of Kyte and Doolittle (Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132) accessed 
on the ProtScale website (www.expasy.ch/cgi-bin/protscale.pl) through the ExPasy molecular biology server. 

15 Figure 7. Percent accessible residues amino acid profile of 101P3A1 1 determined by computer algorithm 

sequence analysis using the method of Janin (Janin J., 1979 Nature 277:491-492) accessed on the ProtScale website 
(www.expasy.ch/cgi-bin/protscale.pl) through the ExPasy molecular biology server. 

Figure 8. Average flexibility amino acid profile of 101P3A1 1 determined by computer algorithm 
sequence analysis using the method of Bhaskaran and Ponnuswamy (Bhaskaran R., and Ponnuswamy P.K., 1988. 

20 Int. J. Pept. Protein Res. 32:242-255) accessed on the ProtScale website (www.expasy.ch/cgi-bin/protscale .pi) 
through the ExPasy molecular biology server. 

Figure 9. Beta-turn amino acid profile of 101P3A1 1 determined by computer algorithm sequence analysis 
using the method of Deleage and Roux (Deleage, G., Roux B. 1987 Protein Engineering 1:289-294) accessed on 
the ProtScale website (www.expasy.ch/cgi-bin/protscale.pl) through the ExPasy molecular biology server. 

25 Figure 10A. Expression of 101P3A11 by RT-PCR. First strand cDNA was prepared from vital pool 1 

(VP1: liver, lung and kidney), vital pool 2 (VP2, pancreas, colon and stomach), prostate xenograft pool, prostate 
cancer pool, kidney cancer pool , colon cancer pool, breast cancer pool, and cancer metastasis pool. Normalization 
was performed by PCR using primers to actin and GAPDH. Semi-quantitative PCR, using primers to 101P3A1 1, 
was performed at 30 cycles of amplification. Expression of 101P3A1 1 was observed in prostate xenograft pool, 

30 prostate cancer pool, kidney cancer pool , colon cancer pool, breast cancer pool, and cancer metastasis pool, but not 
in VP1 and VP2. 

Figure 10B. Expression of 101P3A1 1 in human cancers demonstrated by dot blot analysis of tumor RNA (T) 
and normal RNA (N) matched samples using patient-derived amplified cDNAs. Up-regulation of PHOR-1 expression 
was found in 3 of 3 prostate cancer patients, 6 of 14 kidney cancer patients, 2 of 8 uterine cancer patients, 3 of 8 stomach 

35 cancer patients and 7 of 7 rectal cancer patients. 

Figure 11. Expression of 101P3A11 in human patient cancer specimens. RNA was extracted from a 
pool of three prostate cancer tumors, kidney cancer tumors, colon cancer tumors, breast cancer tumors, and a cancer 
metastasis pool derived from cancer patients, as well as from normal prostate (NP), normal bladder (NB), normal 
kidney (NK) and normal colon (NC). Northern blots with 10 |j.g of total RN A/lane were probed with a 101P3A1 1 

40 fragment. Size standards in kilobases (kb) are indicated on the side. The results showed expression of 101P3A1 1 
in prostate cancer tumors, kidney cancer tumors, colon cancer tumors, breast cancer tumors, cancer metastasis pool, 
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bladder cancer pool, and in the normal prostate but not in the other normal tissues. A picture of the ethidium- 
bromide staining of the RNA gel is also presented. 

Figure 12A. Expression of 101P3A11 in prostate cancer patient specimens. RNA was extracted from 
prostate tumors (T) and their normal adjacent tissues (Nat) derived from prostate cancer patients. Northern blots 
5 with 10 |ig of total RNA/lane were probed with 101P3A1 1 sequences. Results show upregulated expression of 
101P3A1 1 in 8 of 10 tumor specimens. 

Figure 12B. Photomicrograph showing 101P3A11 expression in prostatic intraepithelial neoplasia (PIN) by 
in situ hybridization with an anti-sense 101P3A1 1 riboprobe. 

Figure 12C. Photomicrograph showing 101P3A11 expression in prostate cancer tissue by in situ 
1 0 hybridization with an anti-sense 1 0 1 P3 A 1 1 riboprobe. 

Figure 12D. Photomicrograph showing 101P3A1 1 expression in prostate cancer by in situ hybridization with 
an anti-sense 101P3A1 1 riboprobe. Note up-regulation of expression relative to normal prostate, FIG. 12E. 

Figure 12E. Photomicrograph showing 101P3A11 expression in normal prostate by in situ hybridization with 
an anti-sense 101P3A1 1 riboprobe. 
15 Figure 13. Expression of 101P3A11 in colon cancer patient specimens. RNA was extracted from 

colon tumors (T) and their normal adjacent tissues (Nat) derived from colon cancer patients. Northern blots with 10 
jig of total RNA/lane were probed with 101P3A1 1 sequences. Size standards in kilobases (kb) are indicated on the 
side. Results showed expression of 101P3A1 1 in colon tumors but not in normal tissues. Expression was also seen 
in the colon cancer cell line T84. A picture of the ethidium-bromide staining of the RNA gel is also presented. 
20 Figure 14. Expression of 101P3A11 in kidney cancer patient specimens. RNA was extracted from 

kidney tumors (T) and their normal adjacent tissues (Nat) derived from kidney cancer patients. Northern blots with 
10 |lg of total RNA/lane were probed with 101P3A1 1 sequences. Size standards in kilobases (kb) are indicated on 
the side. The results showed expression of 101P3A1 1 in five of six kidney tumor specimens. The expression 
detected in normal adjacent tissues (isolated from diseased tissues) but not in normal tissues (isolated from healthy 
25 donors) indicates that these tissues are not fully normal and that 101P3A1 1 is expressed in early stage tumors. A 
picture of the ethidium-bromide staining of the RNA gel is also presented. 

Figures 15A-15C. Androgen regulation of 101P3A11 in tissue culture cells. LAPC-9 cells were grown 
in charcoal-stripped medium and stimulated with the synthetic androgen mibolerone, for either 14 or 24 hours. 
Northern blots with 10 ^ig of total RNA/lane were probed with 101P3A1 1 sequences (Figure 15A). A picture of the 
30 ethidium-bromide staining of the RNA gel is also presented (Figure 15C). Results showed expression of 101P3A1 1 
was not regulated by androgen. The experimental samples were confirmed by testing for the expression of the 
androgen-regulated prostate cancer gene PSA (Figure 15B). This experiment showed that, as expected, PSA levels 
go down in presence of charcoal-stripped serum, and expression is induced at 14 and 24 hours in presence of 
mibolerone. 

35 Figure 16. Androgen regulation of 101 P3 All in vivo, Male mice were injected with LAPC-9 AD tumor 

cells. When tumors reached a palpable size (0.3-0. 5cm in diameter), mice were castrated and tumors harvested at 
different time points following the castration. RNA was isolated from the xenograft tissues. Northern blots with 10 
|Ug of total RNA/lane were probed with 101P3A1 1 sequences. Size standards in kilobases (kb) are indicated on the 
side. A picture of the ethidium-bromide staining of the RNA gel is also presented. The results showed that 

40 expression of 101P3A1 1 is not androgen regulated. 
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Figure 17. Expression and detection of 101P3All(159-202)-psecFc fusion protein. The 
101P3A1 l(159-202)-psecFc vector was constructed. The recombinant expression vector DNA was transfected into 
either 29 3T cells or Cos-7 cells. Cells as well as culture supernatants (media) were harvested 24 hours later. The 
cells were lysed, and run on SDS-PAGE gel along with the media samples. The gel was transferred to 
5 nitrocellulose, stained with HRP-labeled anti-human IgG and developed using the ECL chemiluminescence 

detection kit. Results showed expression of the 101P3A1 l(159-202)-psecFc fusion protein in the lysates of both 
293T and Cos-7 cells. The 101P3A1 l(159-202)-psecFc fusion protein was also secreted and detected in the culture 
supernatants of both cell types. 

Figure 18. Expression of 101P3A11 in 300.19 cells following retroviral-mediated gene delivery. 

10 300.19 cells were transduced with the pSRa retroviral vector encoding the 101P3A1 1 gene. Following selection 
with neomycin, the cells were expanded and RNA was extracted. A Northern blot with 10 \ig of total RNA/lane 
was probed with the 101P3A1 1 sequence. Size standards in kilobases (kb) are indicated on the side. Results 
showed expression of the 101P3A1 1 transcript driven from the retroviral LTR. LAPC-4AD and LAPC-9AD 
showed expression of the endogenous 101P3A1 1 transcript. The figure shows results of a short exposure of the 

1 5 autoradiogram. 

Figures 19A-19C. Secondary structure and transmembrane prediction for 101P3A1 1 (SEQ ID NO: 2866). 
Figure 19 A: The secondary structure of 101P3A1 1 protein was predicted using the HNN - Hierarchical Neural 
Network method (Guermeur, 1997, http://pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_nn.html), accessed from 
the ExPasy molecular biology server (http://www.expasy.ch/tools/). This method predicts the presence and location 

20 of alpha helices, extended strands, and random coils from the primary protein sequnce. The percent of the protein 

in a given secondary structure is also given. Figure 1 9B is a schematic representation of the probability of existence 
of transmembrane regions and orientation of 101P3A1 1 based on the TMpred algorithm of Hofmann and Stoffel 
which utilizes TMB ASE (K. Hofmann, W. Stoffel. TMBASE - A database of membrane spanning protein 
segments Biol. Chem. Hoppe-Seyler 374:166, 1993). Figure 19C is a schematic representation of the probability of 

25 the existence of transmembrane regions and the extracellular and intracellular orientation of 101P3A1 1 based on the 
TMHMM algorithm of Sonnhammer, von Heijne, and Krogh (Erik L.L. Sonnhammer, Gunnar von Heijne, and 
Anders Krogh: A hidden Markov model for predicting transmembrane helices in protein sequences. In Proc. of 
Sixth Int. Conf. on Intelligent Systems for Molecular Biology, p 175-182 Ed J. Glasgow, T. Littlejohn, F. Major, R. 
Lathrop, D. Sankoff, and C. Sensen Menlo Park, CA: AAAI Press, 1998). The TMpred and TMHMM algorithms 

30 are accessed from the ExPasy molecular biology server (http://www.expasy.ch/tools/). The results of the 
transmembrane prediction programs presented in Figure 19B and 19C depict 101P3A1 1 as containing 7 
transmembrane domains consistent with that of a G-protein coupled receptor. 

Figure 20. Expression of 101P3A11 in NIH-3T3 Tumors. Mice were injected subcutaneously with 
control 3T3-neo or NIH3T3 cells expressing 101P3A1 1 . Tumors were allowed to grow, the mice were then 

35 sacrificed and tumors harvested. RNA was isolated from LAPC-4AD and LAPC-4AI xenografts, 3T3-neo and 

3T3-101P3A1 1 cells grown in culture were used as controls. RNA isolated from six different tumors derived from 
3T3-101P3A1 1 cells (Tumor #1-3) were compared by Northern blotting. Northern blots with 10 fig of total 
RNA/lane were probed with 101P3A1 1 sequence. A picture of the ethidium-bromide staining of the RNA gel is 
also presented. Results showed expression of 101P3A1 1 in all 3T3-101P3A1 1 tumors as well as in 3T3/101P3A1 1 

40 cells used to derive the rumors, but not in the negative control cells 3T3/neo cells. 
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Figure 21. 101P3A1 1 Induces Tumor Formation of 3T3 Cells. Injection of 106 3T3-neo, 3T3-Ras or 
3T3-101P3A1 1 cells (106 of the indicated cells mixed with Matrigel) subcutaneously into 6 male SCID mice (right 
flank) revealed that 6/6 3T3-vl2Ras-injected mice formed tumors, 6/6 3T3-101P3A1 1- injected mice formed 
tumors, and 0/6 3T3-neo-injected mice formed tumors. Each data point represents the mean tumor volume (n = 6) 
5 in each group. 

Figure 22. PTX reduces the in vivo growth of 3T3-101P3A1 1 Tumors. Pertussis toxin was found to 
inhibit the sub-cutaneous growth of 3T3-101P3A1 1 tumors in SCID mice in a dose dependent manner. 

Figure 23. Alignment of 101P3A1 1-PHOR-l (Phor- SEQ ID NO: 2869) with the rat GPCR RA1C - 
SEQ ID NO: 2870 (gi|3420759). Identities = 179/299 (59%), Positives = 231/299 (76%), Gaps = 1/299 (0%). 
10 Figure 24. Alignment of 101P3A1 1-PHOR-l (Phor- SEQ ID NO: 2871) with the human prostate 

specific GPCR- SEQ ID NO: 2872 (gi| 13540539). Identities = 179/299 (59%), Positives = 233/299 (77%), Gaps = 
1/299 (0%). 

Figure 25. Alignment of 101P3A1 1-PHOR-l (Phor- SEQ ID NO: 2873) with human olfactory receptor 
51112, HOR5- SEQ ID NO: 2874, (gi| 14423836). Identities = 163/304 (53%), Positives = 214/304 (69%), Gaps = 
15 1/304(0%). 

Figure 26. 101P3A1 1 Modulated Tyrosine Phosphorylation in NIH-3T3 Cells. 101P3A1 1 mediated the 
de-phosphorylation of proteins at 200, 120-140, 85-90 and 55 kDa. 101P3A1 1 induced the phosphorylation of 
proteins at 80 and 29 kDa in NIH-3T3 cells. 

Figure 27. ERK Phosphorylation by PCR ligands in 101P3A1 1 Expressing Cells. FBS, lipophosphatidic 
20 acid, gastrin releasing peptide, leukotriene and platelet activating factor induced the phosphorylation of ERK in 
101P3A11 expressing cells. 

Figure 28. Inhibition of 101P3A1 1 -Mediated ERK Activation by PD98059. ERK phosphorylation was 
inhibited by a MEK specific(PD98059) but not a p38 specific (SB203580) inhibitor in PC3-101P3A1 1 cells. 

Figure 29. Enhanced ERK Phosphorylation in Sodium Orthovanadate Treated PC3-101P3A1 1 Cells. 
25 Sodium orthovanadate induced increased ERK phosphorylation in PC3-101P3A1 1 cells relative to PC3-neo cells. 

Figure 30. Inhibition of 101P3A1 1 -Mediated ERK Phosphorylation by AG1517. The EGFR inhibitor, 
AG1517, inhibits EGF-mediated ERK phosphorylation in control and 101P3A1 1-expressing PC3 cells. AG1517 
partially inhibits 101P3A1 1 mediated ERK phosphorylation in PC3 cells. 

Figures 31A-31B. Activation of p38 in PC3-101P3A1 1 Cells. Expression of 10 1P3A1 1 mediates p38 
30 phosphorylation in cells treated with 10% FBS as shown by blotting with antibodies to phospho-p38 (Figure 3 1 A) 
compared to p38 (Figure 3 IB). 

Figure 32. 101P3A1 1 Induced Accumulation of cAMP in PC3 Cells. Expression of 101P3A1 1 increased 
the accumulation of cAMP in cells treated with 0. 1% and 10% FBS. FBS-induced cAMP accumulation in 
101P3A1 1 cells was inhibited by pertussis toxin. 
35 Figure 33. Pertussis Toxin Inhibits 101P3A1 1 Mediated ERK Phosphorylation. Pertussis toxin inhibited 

FBS- mediated ERK phosphorylation in 101P3A1 1 expressing cells. 

Figure 34. Pertussis Toxin Inhibited ERK Phosphorylation in PC3- 10 1P3A1 1 Cells. Pertussis toxin 
inhibited FBS- mediated ERK phosphorylation in 101P3A1 1 expressing cells. The inhibitory activity of pertussis 
toxin on ERK phosphorylation was more dramatic in FBS-treated than EGF or GRP-treated PC3-101P3A1 1 cells 
40 Figure 35. Inhibition of 101P3A1 1-mediated signaling by Suranim, a G protein inhibitor. Control NIH 

3T3 and 3T3-101P3A1 1 cells were grown in the presence of absence of G protein inhibitors Surinam and NF449. 
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Proliferation was analyzed by Alamar blue after 72 hours. Suranim and NF449 inhibited the proliferation of 
101P3A1 1 expressing but not control cells. 

Figures 36A-36B. 101P3A11 Mediated ERK Phosphorylation By Conditioned Media. Figure 3 6 A: 
blotting with anti-phospho ERK antibodies; Figure 36B: blotting with anti-ERK antibodies. Supernatants from 
5 PC3, PC3-101P3A1 1, PrEC and LAPC42 cells induce ERK phosphorylation in PC3 101P3A1 1 but not PC3 cells. 
Superaantants from 3T3 and 293T cells had little specific effect on ERK phosphorylation. 

Figure 37. 101P3A1 1 Enhances the Proliferation of 3T3 Cells. Control NIH 3T3 and 3T3-101P3A1 1 
cells were grown in the presence of absence 0.5 or 10% FBS. Proliferation was analyzed by Alamar blue after 48 
hours. Expression of 101P3A1 1 induced a 6 fold increase in the proliferation of 3T3 cells grown in 0.5% FBS. 

10 Figure 38. Inhibition of 101P3A1 1 Mediated ERK Phosphorylation by 101P3A1 1 Specific Antibodies. 

Expression of 101P3A1 1 induced ERK phosphorylation in 293T cells. Anti-101P3A1 1 pAb inhibited ERK 
Phosphorylation in 293T-101P3A1 1 cells. 

Figure 39. Anti-101P3A1 1 Ab Mediated cAMP Accumulation in PC3-101P3A1 1 Cells. Control PC3 
cells and cells expressing 101P3A1 1 were treated with anti-101P3Al 1 pAb for 2 min and evaluated for 

15 intracellular cAMP content. The assay was performed in duplicate. 

Figures 40A-40F. Photomicrographs showing immunohistochemical analysis using anti-101P3Al 1 (peptide 
1 ; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin embedded prostate cancer tissues (Figure 
40 A); anti-101P3Al 1 (peptide 1 ; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin 
embedded prostate cancer cell line, LNCaP (Figure 40B); anti-101P3Al 1 (peptide 1; amino acids 1-14) rabbit 

20 polyclonal antibody on formalin fixed and paraffin embedded prostate cancer tissues (Figure 40C); anti-101P3Al 1 
(peptide 1; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin embedded normal prostate 
(Figure 40D); anti-101P3Al 1 (peptide 1; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin 
embedded prostate cancer tissues (Figure 40E); and anti-101P3Al 1 (peptide 1 ; amino acids 1-14) rabbit polyclonal 
antibody on formalin fixed and paraffin embedded normal prostate (Figure 40F). 

25 Figures 41A-41F. Photomicrographs showing immunohistochemical analysis using anti-101P3Al 1 (peptide 

1 ; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin embedded prostate cancer tissues (Figure 
41 A); anti-101P3Al 1 (peptide 1; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin 
embedded bladder cancer tissues (Figure 4 IB); anti-101P3Al 1 (peptide 1; amino acids 1-14) rabbit polyclonal antibody 
on formalin fixed and paraffin embedded kidney cancer tissues (Figure 41 C); anti-101P3Al 1 (peptide 1; amino acids 1- 

30 14) rabbit polyclonal antibody on formalin fixed and paraffin embedded colon cancer tissues (Figure 4 ID); anti- 

101P3A1 1 (peptide 1; amino acids 1-14) rabbit polyclonal antibody on formalin fixed and paraffin embedded lung 
cancer tissues (Figure 4 IE); and anti-101P3Al 1 (peptide 1; amino acids 1-14) rabbit polyclonal antibody on formalin 
fixed and paraffin embedded breast cancer tissues (Figure 4 IF). 

Figure 42 shows demonstrates that 101P3A1 1 induces orthotopic growth of tumors. 5xl0 5 cells were 

35 injected orthotopically into SCID mice, 7 mice per group; tumor weight was evaluated 24-25 days post cell 
injection. 

Figure 43 shows that 101P3A1 1 induces colony formation in a soft agar assay. 


DETAILED DESCRIPTION OF THE INVENTION 
40 Outline of Sections 

I.) Definitions 
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5 II.A.3.) Primers and Primer Pairs 

II.A.4.) Isolation of 101P3A11-Encoding Nucleic Acid Molecules 
II. A.5.) Recombinant Nucleic Acid Molecules and Host- Vector Systems 
m.) 101P3 All-related Proteins 

m.A.) Motif-bearing Protein Embodiments 
10 III.B.) Expression of 101P3All-related Proteins 

HLC.) Modifications of 10 1P3 All-related Proteins 
HI.D.) Uses of 101P3All-related Proteins 

IV. ) 101P3A11 Antibodies 

V. ) 101P3A11 Cellular Immune Responses 
15 VI.) 101P3A11 Transgenic Animals 

VII. ) Methods for the Detection of 101P3A11 

VIII. ) Methods for Monitoring the Status of 101P3 All-related Genes and Their Products 

IX. ) Identification of Molecules That Interact With 101P3A11 

X. ) Therapeutic Methods and Compositions 
20 X.A.) Anti-Cancer Vaccines 

X.B.) 101P3A11 as a Target for Antibody-Based Therapy 
X.C.) 101P3A11 as a Target for Cellular Immune Responses 
X.C.I. Minigene Vaccines 

X.C.2. Combinations of CTL Peptides with Helper Peptides 
25 X.C.3. Combinations of CTL Peptides with T Cell Priming Agents 

X.C.4. Vaccine Compositions Comprising DC Pulsed with CTL and/or HTL 
Peptides 

X.D.) Adoptive Immunotherapy 

X.E.) Administration of Vaccines for Therapeutic or Prophylactic Purposes 
30 XI.) Diagnostic and Prognostic Embodiments of 101P3A11. 

XII.) Inhibition of 101P3A11 Protein Function 

XII. A.) Inhibition of 101P3A11 With Intracellular Antibodies 
XII.B.) Inhibition of 101P3A11 with Recombinant Proteins 
XII.C.) Inhibition of 101P3A11 Transcription or Translation 
35 XII.D.) General Considerations for Therapeutic Strategies 

XIIL) KITS 

I.) Definitions: 

Unless otherwise defined, all terms of art, notations and other scientific terms or terminology used herein 
40 are intended to have the meanings commonly understood by those of skill in the art to which this invention pertains. 
In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, 
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and the inclusion of such definitions herein should not necessarily be construed to represent a substantial difference 
over what is generally understood in the art. Many of the techniques and procedures described or referenced herein 
are well understood and commonly employed using conventional methodology by those skilled in the art, such as, 
for example, the widely utilized molecular cloning methodologies described in Sambrook et al., Molecular Cloning: 
5 A Laboratory Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. As 
appropriate, procedures involving the use of commercially available kits and reagents are generally carried out in 
accordance with manufacturer defined protocols and/or parameters unless otherwise noted. 

The terms "advanced prostate cancer", "locally advanced prostate cancer", "advanced disease" and 
"locally advanced disease" mean prostate cancers that have extended through the prostate capsule, and are meant to 

10 include stage C disease under the American Urological Association (AUA) system, stage CI - C2 disease under the 
Whitmore-Jewett system, and stage T3 - T4 and N+ disease under the TNM (tumor, node, metastasis) system. In 
general, surgery is not recommended for patients with locally advanced disease, and these patients have 
substantially less favorable outcomes compared to patients having clinically localized (organ-confined) prostate 
cancer. Locally advanced disease is clinically identified by palpable evidence of induration beyond the lateral 

1 5 border of the prostate, or asymmetry or induration above the prostate base. Locally advanced prostate cancer is 

presently diagnosed pathologically following radical prostatectomy if the tumor invades or penetrates the prostatic 
capsule, extends into the surgical margin, or invades the seminal vesicles. 

"Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 
carbohydrate moieties found in native sequence 101P3A1 1 (either by removing the underlying glycosylation site or 

20 by deleting the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites 
that are not present in the native sequence 101P3A1 1. In addition, the phrase includes qualitative changes in the 
glycosylation of the native proteins, involving a change in the nature and proportions of the various carbohydrate 
moieties present. 

The term "analog" refers to a molecule which is structurally similar or shares similar or corresponding 

25 attributes with another molecule (e.g. a 101P3A1 1 -related protein). For example an analog of the 101P3A1 1 protein can 
be specifically bound by an antibody or T cell that specifically binds to 101P3A1 1. 

The term "antibody" is used in the broadest sense. Therefore an "antibody" can be naturally occurring or man- 
made such as monoclonal antibodies produced by conventional hybridoma technology. Anti-101P3A1 1 antibodies 
comprise monoclonal and polyclonal antibodies as well as fragments containing the antigen-binding domain and/or one 

30 or more complementarity determining regions of these antibodies. 

An "antibody fragment" is defined as at least a portion of the variable region of the immunoglobulin 
molecule that binds to its target, i.e., the antigen-binding region. In one embodiment it specifically covers single anti- 
101P3A1 1 antibodies and clones thereof (including agonist, antagonist and neutralizing antibodies) and anti-101P3Al 1 
antibody compositions with polyepitopic specificity. 

35 The term "codon optimized sequences" refers to nucleotide sequences that have been optimized for a 

particular host species by replacing any codons having a usage frequency of less than about 20%. Nucleotide 
sequences that have been optimized for expression in a given host species by elimination of spurious 
polyadenylation sequences, elimination of exon/intron splicing signals, elimination of transposon-like repeats 
and/or optimization of GC content in addition to codon optimization are referred to herein as an "expression 

40 enhanced sequences." 
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The term "cytotoxic agent" refers to a substance that inhibits or prevents the function of cells and/or causes 
destruction of cells. The term is intended to include radioactive isotopes chemotherapeutic agents, and toxins such 
as small molecule toxins or enzymatically active toxins of bacterial, fungal, plant or animal origin, including 
fragments and/or variants thereof. Examples of cytotoxic agents include, but are not limited to maytansinoids, 
5 yttrium, bismuth, ricin, ricin A-chain, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin, diphtheria toxin, 
Pseudomonas exotoxin (PE) A, PE40, abrin, abrin A chain, modeccin A chain, alpha-sarcin, gelonin, mitogellin, 
retstrictocin, phenomycin, enomycin, curicin, crotin, calicheamicin, sapaonaria officinalis inhibitor, and 
glucocorticoid and other chemotherapeutic agents, as well as radioisotopes such as At 211 , 1 131 , 1 125 , Y 90 , Re 186 , Re 188 , 

10 Sm 153 , Bi 212 , P 32 and radioactive isotopes of Lu. Antibodies may also be conjugated to an anti-cancer pro-drug 
activating enzyme capable of converting the pro-drug to its active form. 

The term "homolog" refers to a molecule which exhibits homology to another molecule, by for example, 
having sequences of chemical residues that are the same or similar at corresponding positions. 

"Human Leukocyte Antigen" or "HLA" is a human class I or class II Major Histocompatibility Complex 

15 (MHC) protein (see, e.g., Stites, et al, IMMUNOLOGY, 8 th Ed., Lange Publishing, Los Altos, CA (1994). 

The terms "hybridize", "hybridizing", "hybridizes" and the like, used in the context of polynucleotides, are 
meant to refer to conventional hybridization conditions, preferably such as hybridization in 50% 
formamide/6XSSC/0.1% SDS/100 |ig/ml ssDNA, in which temperatures for hybridization are above 37 degrees C 
and temperatures for washing in 0.1XSSC/0.1% SDS are above 55 degrees C. 

20 The phrases "isolated" or "biologically pure" refer to material which is substantially or essentially free 

from components which normally accompany the material as it is found in its native state. Thus, isolated peptides 
in accordance with the invention preferably do not contain materials normally associated with the peptides in their 
in situ environment. For example, a polynucleotide is said to be "isolated" when it is substantially separated from 
contaminant polynucleotides that correspond or are complementary to genes other than the 101P3A1 1 gene or that 

25 encode polypeptides other than 101P3A1 1 gene product or fragments thereof. A skilled artisan can readily employ 

nucleic acid isolation procedures to obtain an isolated 101P3A1 1 polynucleotide. A protein is said to be "isolated," for 
example, when physical, mechanical or chemical methods are employed to remove the 101P3A1 1 protein from cellular 
constituents that are normally associated with the protein. A skilled artisan can readily employ standard purification 
methods to obtain an isolated 101P3A1 1 protein. Alternatively, an isolated protein can be prepared by chemical means. 

30 The term "mammal" refers to any organism classified as a mammal, including mice, rats, rabbits, dogs, cats, 

cows, horses and humans. In one embodiment of the invention, the mammal is a mouse. In another embodiment of the 
invention, the mammal is a human. 

The terms "metastatic prostate cancer" and "metastatic disease" mean prostate cancers that have spread to 
regional lymph nodes or to distant sites, and are meant to include stage D disease under the AUA system and stage 

35 TxNxM+ under the TNM system. As is the case with locally advanced prostate cancer, surgery is generally not 
indicated for patients with metastatic disease, and hormonal (androgen ablation) therapy is a preferred treatment 
modality. Patients with metastatic prostate cancer eventually develop an androgen-refractory state within 12 to 18 
months of treatment initiation. Approximately half of these androgen-refractory patients die within 6 months after 
developing that status. The most common site for prostate cancer metastasis is bone. Prostate cancer bone 

40 metastases are often osteoblastic rather than osteolytic (i.e., resulting in net bone formation). Bone metastases are 
found most frequently in the spine, followed by the femur, pelvis, rib cage, skull and humerus. Other common sites 
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for metastasis include lymph nodes, lung, liver and brain. Metastatic prostate cancer is typically diagnosed by open 
or laparoscopic pelvic lymphadenectomy, whole body radionuclide scans, skeletal radiography, and/or bone lesion 
biopsy. 

The term "monoclonal antibody" refers to an antibody obtained from a population of substantially 
5 homogeneous antibodies, i.e., the antibodies comprising the population are identical except for possible naturally 
occurring mutations that are present in minor amounts. 

A "motif, as in biological motif of an 10 1P3A1 1 -related protein, refers to any pattern of amino acids 
forming part of the primary sequence of a protein, that is associated with a particular function (e.g. protein-protein 
interaction, protein-DNA interaction, etc) or modification (e.g. that is phosphorylated, glycosylated or amidated), or 
10 localization (e.g. secretory sequence, nuclear localization sequence, etc.) or a sequence that is correlated with being 
immunogenic, either Immorally or cellularly. A motif can be either contiguous or capable of being aligned to 
certain positions that are generally correlated with a certain function or property. In the context of HLA motifs, 
"motif refers to the pattern of residues in a peptide of defined length, usually a peptide of from about 8 to about 13 
amino acids for a class I HLA motif and from about 6 to about 25 amino acids for a class II HLA motif, which is 
15 recognized by a particular HLA molecule. Peptide motifs for HLA binding are typically different for each protein 
encoded by each human HLA allele and differ in the pattern of the primary and secondary anchor residues. 

A "pharmaceutical excipienf comprises a material such as an adjuvant, a carrier, pH-adjusting and 
buffering agents, tonicity adjusting agents, wetting agents, preservative, and the like. 

"Pharmaceutically acceptable" refers to a non-toxic, inert, and/or composition that is physiologically 
20 compatible with humans or other mammals. 

The term "polynucleotide" means a polymeric form of nucleotides of at least 10 bases or base pairs in 
length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide, and is meant to 
include single and double stranded forms of DNA and/or RNA. In the art, this term if often used interchangeably 
with "oligonucleotide". A polynucleotide can comprise a nucleotide sequence disclosed herein wherein thymidine 
25 (T) (as shown for example in SEQ ID NO: 702) can also be uracil (U); this definition pertains to the differences 

between the chemical structures of DNA and RNA, in particular the observation that one of the four major bases in 
RNA is uracil (U) instead of thymidine (T). 

The term "polypeptide" means a polymer of at least about 4, 5, 6, 7, or 8 amino acids. Throughout the 
specification, standard three letter or single letter designations for amino acids are used. In the art, this term is often 
30 used interchangeably with "peptide" or "protein". 

An HLA "primary anchor residue" is an amino acid at a specific position along a peptide sequence which 
is understood to provide a contact point between the immunogenic peptide and the HLA molecule. One to three, 
usually two, primary anchor residues within a peptide of defined length generally defines a "motif for an 
immunogenic peptide. These residues are understood to fit in close contact with peptide binding groove of an HLA 
35 molecule, with their side chains buried in specific pockets of the binding groove. In one embodiment, for example, 
the primary anchor residues for an HLA class I molecule are located at position 2 (from the amino terminal 
position) and at the carboxyl terminal position of a 8, 9, 10, 1 1 , or 12 residue peptide epitope in accordance with the 
invention. In another embodiment, for example, the primary anchor residues of a peptide that will bind an HLA 
class II molecule are spaced relative to each other, rather than to the termini of a peptide, where the peptide is 
40 generally of at least 9 amino acids in length. The primary anchor positions for each motif and supermotif are set 
forth in Table IV. For example, analog peptides can be created by altering the presence or absence of particular 
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residues in the primary and/or secondary anchor positions shown in Table IV. Such analogs are used to modulate 
the binding affinity and/or population coverage of a peptide comprising a particular HLA motif or supermotif. 

A "recombinant" DNA or RNA molecule is a DNA or RNA molecule that has been subjected to molecular 
manipulation in vitro. 

5 "Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 

generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In 
general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured nucleic acid sequences to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree of 

10 desired homology between the probe and hybridizable sequence, the higher the relative temperature that can be 
used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more 
stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization 
reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, are identified by, but not limited 

15 to, those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% 
Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 
mM sodium citrate at 42 °C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 

20 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm 
DNA (50 Jjig/ml), 0. 1% SDS, and 10% dextran sulfate at 42 °C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium, citrate) and 50% formamide at 55 °C, followed by a high-stringency wash consisting of 0.1 x SSC 
containing EDTA at 55 °C. "Moderately stringent conditions" are described by, but not limited to, those in 
Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and 

25 include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less 
stringent than those described above. An example of moderately stringent conditions is overnight incubation at 
37°C in a solution comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/mL denatured sheared salmon sperm 
DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust 

30 the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

An HLA "supermotif is a peptide binding specificity shared by HLA molecules encoded by two or more 
HLA alleles. 

As used herein "to treat" or "therapeutic" and grammatically related terms, refer to any improvement of 
any consequence of disease, such as prolonged survival, less morbidity, and/or a lessening of side effects which are 
35 the byproducts of an alternative therapeutic modality; full eradication of disease is not required. 

A "transgenic animal" (e.g., a mouse or rat) is an animal having cells that contain a transgene, which 
transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A 
"transgene" is a DNA that is integrated into the genome of a cell from which a transgenic animal develops. 

As used herein, an HLA or cellular immune response "vaccine" is a composition that contains or encodes 
40 one or more peptides of the invention. There are numerous embodiments of such vaccines, such as a cocktail of one 
or more individual peptides; one or more peptides of the invention comprised by a polyepitopic peptide; or nucleic 
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acids that encode such individual peptides or polypeptides, e.g., a minigene that encodes a polyepitopic peptide. 
The "one or more peptides" can include any whole unit integer from 1-150 or more, e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17,18, 19, 20,21,22,23,24, 25,26, 27, 28,29, 30,31,32, 33,34, 35,36, 37,38,39, 40, 
41,42,43,44, 45,46,47, 48,49, 50, 55,60, 65,70, 75, 80,85,90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 
5 145, or 150 or more peptides of the invention. The peptides or polypeptides can optionally be modified, such as by 
lipidation, addition of targeting or other sequences. HLA class I peptides of the invention can be admixed with, or 
linked to, HLA class II peptides, to facilitate activation of both cytotoxic T lymphocytes and helper T lymphocytes. 
HLA vaccines can also comprise peptide-pulsed antigen presenting cells, e.g. , dendritic cells. 

The term "variant" refers to a molecule that exhibits a variation from a described type or norm, such as a 

10 protein that has one or more different amino acid residues in the corresponding position(s) of a specifically described 
protein (e.g. the 101P3A1 1 protein shown in Figure 2 or Figure 3). An analog is an example of a variant protein. 

The 101P3A1 1 -related proteins of the invention include those specifically identified herein, as well as allelic 
variants, conservative substitution variants, analogs and homologs that can be isolated/generated and characterized 
without undue experimentation following the methods outlined herein or readily available in the art. Fusion proteins 

1 5 that combine parts of different 101P3A1 1 proteins or fragments thereof, as well as fusion proteins of a 101P3A1 1 

protein and a heterologous polypeptide are also included. Such 101P3A1 1 proteins are collectively referred to as the 
101P3A1 1-related proteins, the proteins of the invention, or 101P3A1 1 . The term "101P3A1 1-related protein" refers to 
a polypeptide fragment or an 101P3A1 1 protein sequence of 4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21, 
22, 23, 24, 25, or more than 25 amino acids; or, at least 30, 35, 40, 45, 50, 55, 60, 65, 70, 80, 85, 90, 95, 100 or more 

20 than 100 amino acids. 

ID 101P3A11 Polynucleotides 

One aspect of the invention provides polynucleotides corresponding or complementary to all or part of an 
101P3A1 1 gene, mRNA, and/or coding sequence, preferably in isolated form, including polynucleotides encoding an 
101P3A1 1-related protein and fragments thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, 
25 polynucleotides or oligonucleotides complementary to an 101P3A1 1 gene or mRNA sequence or a part thereof, and 
polynucleotides or oligonucleotides that hybridize to an 101P3A1 1 gene, mRNA, or to an 101P3A1 1 encoding 
polynucleotide (collectively, "101P3A1 1 polynucleotides"). In all instances when referred to in this section, T can 
also be U in Figure 2. 

Embodiments of a 101P3A1 1 polynucleotide include: a 101P3A1 1 polynucleotide having the sequence 
30 shown in Figure 2, the nucleotide sequence of 101P3A1 1 as shown in Figure 2, wherein T is U; at least 10 

contiguous nucleotides of a polynucleotide having the sequence as shown in Figure 2; or, at least 10 contiguous 
nucleotides of a polynucleotide having the sequence as shown in Figure 2 where T is U. For example, embodiments 
of 101P3A1 1 nucleotides comprise, without limitation: 

(a) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
35 Figure 2 (SEQ ID NO.: 2865), wherein T can also be U; 

(b) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
Figure 2 (SEQ ID NO.: 2865), from nucleotide residue number 133 through nucleotide residue number 
1086, wherein T can also be U; 
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(c) a polynucleotide that encodes a 101P3A1 1 -related protein whose sequence is encoded by the 
cDNAs contained in the plasmid designated pl01P3Al 1, clone GTH10 deposited with American Type 
Culture Collection as Accession No. PTA-312 on July 2, 1999; 

(d) a polynucleotide that encodes an 10 1P3A1 1 -related protein that is at least 90% homologous to the 
entire amino acid sequence shown in Figure 2 (SEQ ID NO.: 2866); 

(e) a polynucleotide that encodes an 101P3A1 1 -related protein that is at least 90% identical to the 
entire amino acid sequence shown in Figure 2 (SEQ ID NO: 2866); 

(f) a polynucleotide that encodes at least one peptide set forth in Tables V-XVIII or XXVI-XXVTII; 

(g) a polynucleotide that encodes a peptide region of at least 5 amino acids of Figure 3 in any whole 
number increment up to 317 that includes an amino acid position having a value greater than 0.5 in the 
Hydrophilicity profile of Figure 5; 

(h) a polynucleotide that encodes a peptide region of at least 5 amino acids of Figure 3 in any whole 
number increment up to 3 17 that includes an amino acid position having a value less than 0.5 in the 
Hydropathicity profile of Figure 6; 

(i) a polynucleotide that encodes a peptide region of at least 5 amino acids of Figure 3 in any whole 
number increment up to 317 that includes an amino acid position having a value greater than 0.5 in the 
Percent Accessible Residues profile of Figure 7; 

(j) a polynucleotide that encodes a peptide region of at least 5 amino acids of Figure 3 in any whole 
number increment up to 317 that includes an amino acid position having a value greater than 0.5 in the 
Average Flexibility profile on Figure 8; 

(k) a polynucleotide that encodes a peptide region of at least 5 amino acids of Figure 3 in any whole 
number increment up to 3 17 that includes an amino acid position having a value greater than 0.5 in the 
Beta-turn profile of Figure 9; 

(1) a polynucleotide that is fully complementary to a polynucleotide of any one of (a)-(k); 

(m) a polynucleotide that selectively hybridizes under stringent conditions to a polynucleotide of (a)- 

(i); 

(n) a peptide that is encoded by any of (a)-(k); and, 

(o) a polynucleotide of any of (a)-(m)or peptide of (n) together with a pharmaceutical excipient 
and/or in a human unit dose form. 

As used herein, a range is understood to specifically disclose all whole unit positions thereof. 
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Typical embodiments of the invention disclosed herein include 101P3A1 1 polynucleotides that encode 
specific portions of the 101P3A1 1 mRNA sequence (and those which are complementary to such sequences) such 
as those that encode the protein and fragments thereof, for example of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20,21,22, 23,24, 25, 30,35,40,45, 50, 55, 60, 65, 70, 75, 80, 85,90, 95, 100, 105, 110, 115, 120, 125, 
5 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 250, 275, 300, 
or 3 1 7 contiguous amino acids. 

For example, representative embodiments of the invention disclosed herein include: polynucleotides and 
their encoded peptides themselves encoding about amino acid 1 to about amino acid 10 of the 101P3A1 1 protein 
shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 10 to about amino acid 20 of the 

10 101P3A1 1 protein shown in Figure 2, or Figure 3, polynucleotides encoding about amino acid 20 to about amino 
acid 30 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 30 to 
about amino acid 40 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polynucleotides encoding about amino 
acid 40 to about amino acid 50 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polynucleotides encoding 
about amino acid 50 to about amino acid 60 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, 

15 polynucleotides encoding about amino acid 60 to about amino acid 70 of the 101P3A1 1 protein shown in Figure 2 
or Figure 3, polynucleotides encoding about amino acid 70 to about amino acid 80 of the 101P3A1 1 protein shown 
in Figure 2 or Figure 3, polynucleotides encoding about amino acid 80 to about amino acid 90 of the 101P3A1 1 
protein shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 90 to about amino acid 100 of 
the 101P3A1 1 protein shown in Figure 2 or Figure 3, in increments of about 10 amino acids, ending at the carboxyl 

20 terminal amino acid set forth in Figure 2 or Figure 3. Accordingly polynucleotides encoding portions of the amino 
acid sequence (of about 10 amino acids), of amino acids 100 through the carboxyl terminal amino acid of the 
101P3A1 1 protein are embodiments of the invention. Wherein it is understood that each particular amino acid 
position discloses that position plus or minus five amino acid residues. 

Polynucleotides encoding relatively long portions of the 101P3A1 1 protein are also within the scope of the 

25 invention. For example, polynucleotides encoding from about amino acid 1 (or 20 or 30 or 40 etc.) to about amino 
acid 20, (or 30, or 40 or 50 etc.) of the 101P3A1 100 protein shown in Figure 2 or Figure 3 can be generated by a 
variety of techniques well known in the art. These polynucleotide fragments can include any portion of the 
101P3A1 1 sequence as shown in Figure 2 or Figure 3. 

Additional illustrative embodiments of the invention disclosed herein include 101P3A1 1 polynucleotide 

30 fragments encoding one or more of the biological motifs contained within the 101P3A1 1 protein sequence, 

including one or more of the subsequences of the 101P3A1 1 protein set forth in Tables V-XVIII or XXVI-XXVIII. 
In another embodiment, typical polynucleotide fragments of the invention encode one or more of the regions of 
101P3A1 1 that exhibit homology to a known molecule. In another embodiment of the invention, typical 
polynucleotide fragments can encode one or more of the 101P3A1 1 N-glycosylation sites, cAMP and cGMP- 

35 dependent protein kinase phosphorylation sites, casein kinase II phosphorylation sites or N-myristoylation site and 
amidation sites. 


II. A.) Uses of 101P3A11 Polynucleotides 
II.A.l.) Monitoring of Genetic Abnormalities 

40 The polynucleotides of the preceding paragraphs have a number of different specific uses. The human 

101P3A1 1 gene maps to the chromosomal location set forth in Table XXV. For example, because the 101P3A1 1 
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gene maps to this chromosome, polynucleotides that encode different regions of the 101P3A1 1 protein are used to 
characterize cytogenetic abnormalities of this chromosomal locale, such as abnormalities that are identified as being 
associated with various cancers. In certain genes, a variety of chromosomal abnormalities including rearrangements 
have been identified as frequent cytogenetic abnormalities in a number of different cancers (see e.g. Krajinovic et 
5 al., Mutat. Res. 382(3-4): 81-83 (1998); Johansson et al., Blood 86(10): 3905-3914 (1995) and Finger et al., 

P.N.A.S. 85(23): 9158-9162 (1988)). Thus, polynucleotides encoding specific regions of the 101P3A1 1 protein 
provide new tools that can be used to delineate, with greater precision than previously possible, cytogenetic 
abnormalities in the chromosomal region that encodes 101P3A1 1 that may contribute to the malignant phenotype. 
In this context, these polynucleotides satisfy a need in the art for expanding the sensitivity of chromosomal 

10 screening in order to identify more subtle and less common chromosomal abnormalities (see e.g. Evans et al., Am. 
J. Obstet. Gynecol 171(4): 1055-1057 (1994)). 

Furthermore, as 101P3A1 1 was shown to be highly expressed in prostate and other cancers, 101P3A1 1 
polynucleotides are used in methods assessing the status of 101P3A1 1 gene products in normal versus cancerous 
tissues. Typically, polynucleotides that encode specific regions of the 101P3A1 1 protein are used to assess the 

1 5 presence of perturbations (such as deletions, insertions, point mutations, or alterations resulting in a loss of an 
antigen etc.) in specific regions of the 101P3A1 1 gene, such as such regions containing one or more motifs. 
Exemplary assays include both RT-PCR assays as well as single-strand conformation polymorphism (SSCP) 
analysis (see, e.g., Marrogi et aL, J. Cutan. Pathol. 26(8): 369-378 (1999), both of which utilize polynucleotides 
encoding specific regions of a protein to examine these regions within the protein. 

20 ILA.2.) Antisense Embodiments 

Other specifically contemplated nucleic acid related embodiments of the invention disclosed herein are 
genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as nucleic acid molecules based on an alternative 
backbone, or including alternative bases, whether derived from natural sources or synthesized, and include molecules 
capable of inhibiting the RNA or protein expression of 101P3A1 1 . For example, antisense molecules can be RNAs or 

25 other molecules, including peptide nucleic acids (PNAs) or non-nucleic acid molecules such as phosphorothioate 

derivatives, that specifically bind DNA or RNA in a base pair-dependent manner. A skilled artisan can readily obtain 
these classes of nucleic acid molecules using the 101P3A1 1 polynucleotides and polynucleotide sequences disclosed 
herein. 

Antisense technology entails the administration of exogenous oligonucleotides that bind to a target 
30 polynucleotide located within the cells. The term "antisense" refers to the fact that such oligonucleotides are 

complementary to their intracellular targets, e.g., 101P3A1 1. See for example, Jack Cohen, Oligodeoxynucleotides, 
Antisense Inhibitors of Gene Expression, CRC Press, 1989; and Synthesis 1:1-5 (1988). The 101P3A1 1 antisense 
oligonucleotides of the present invention include derivatives such as S-oligonucleotides (phosphorothioate 
derivatives or S-oligos, see, Jack Cohen, supra), which exhibit enhanced cancer cell growth inhibitory action. S- 
35 oligos (nucleoside phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in which a 
nonbridging oxygen atom of the phosphate group is replaced by a sulfur atom. The S-oligos of the present 
invention can be prepared by treatment of the corresponding O-oligos with 3H-l,2-benzodithiol-3 -one- 1,1 -dioxide, 
which is a sulfur transfer reagent. See Iyer, R. P. et al, J. Org. Chem. 55:4693-4698 (1990); and Iyer, R. P. et al., J. 
Am. Chem. Soc. 1 12:1253-1254 (1990). Additional 101P3A1 1 antisense oligonucleotides of the present invention 
40 include morpholino antisense oligonucleotides known in the art (see, e.g., Partridge et al., 1996, Antisense & 
Nucleic Acid Drug Development 6: 169-175). 
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The 101P3A1 1 antisense oligonucleotides of the present invention typically can be RNA or DNA that is 
complementary to and stably hybridizes with the first 100 5' codons or last 100 3' codons of the 101P3A1 1 
genomic sequence or the corresponding mRNA. Absolute complementarity is not required, although high degrees 
of complementarity are preferred. Use of an oligonucleotide complementary to this region allows for the selective 
5 hybridization to 101P3A1 1 mRNA and not to mRNA specifying other regulatory subunits of protein kinase. In one 
embodiment, 101P3A1 1 antisense oligonucleotides of the present invention are 15 to 30-mer fragments of the 
antisense DNA molecule that have a sequence that hybridizes to 101P3A1 1 mRNA. Optionally, 101P3A1 1 
antisense oligonucleotide is a 30-mer oligonucleotide that is complementary to a region in the first 10 5' codons or 
last 10 3' codons of 101P3A1 1. Alternatively, the antisense molecules are modified to employ ribozymes in the 
10 inhibition of 101P3A1 1 expression, see, e.g., L. A. Couture & D. T. Stinchcomb; Trends Genet 12: 510-515 
(1996). 

II.A.3.) Primers and Primer Pairs 

Further specific embodiments of this nucleotides of the invention include primers and primer pairs, which 
allow the specific amplification of polynucleotides of the invention or of any specific parts thereof, and probes that 
1 5 selectively or specifically hybridize to nucleic acid molecules of the invention or to any part thereof. Probes can be 
labeled with a detectable marker, such as, for example, a radioisotope, fluorescent compound, bioluminescent 
compound, a chemiluminescent compound, metal chelator or enzyme. Such probes and primers are used to detect 
the presence of a 101P3A1 1 polynucleotide in a sample and as a means for detecting a cell expressing a 101P3A1 1 
protein. 

20 Examples of such probes include polypeptides comprising all or part of the human 101P3A1 1 cDNA sequence 

shown in Figure 2. Examples of primer pairs capable of specifically amplifying 101P3A1 1 mRNAs are also described 
in the Examples. As will be understood by the skilled artisan, a great many different primers and probes can be 
prepared based on the sequences provided herein and used effectively to amplify and/or detect a 101P3A1 1 mRNA. 
The 101P3A1 1 polynucleotides of the invention are useful for a variety of purposes, including but not 

25 limited to their use as probes and primers for the amplification and/or detection of the 101P3A1 1 gene(s), 

mRNA(s), or fragments thereof; as reagents for the diagnosis and/or prognosis of prostate cancer and other cancers; 
as coding sequences capable of directing the expression of 101P3A1 1 polypeptides; as tools for modulating or 
inhibiting the expression of the 101P3A1 1 gene(s) and/or translation of the 101P3A1 1 transcript(s); and as 
therapeutic agents. 

30 The present invention includes the use of any probe as described herein to identify and isolate al01P3Allor 

101P3A1 1 related nucleic acid sequence from a naturally occurring source, such as humans or other mammals, as well 
as the isolated nucleic acid sequence per se, which would comprise all or most of the sequences found in the probe used. 

ILA.4.) Isolation of 101P3 All-Encoding Nucleic Acid Molecules 

35 The 101P3 Al 1 cDNA sequences described herein enable the isolation of other polynucleotides encoding 

101P3A1 1 gene product(s), as well as the isolation of polynucleotides encoding 101P3A1 1 gene product homologs, 
alternatively spliced isoforms, allelic variants, and mutant forms of the 101P3A1 1 gene product as well as 
polynucleotides that encode analogs of 101P3A1 1 -related proteins. Various molecular cloning methods that can be 
employed to isolate full length cDNAs encoding an 101P3 Al 1 gene are well known (see, for example, Sambrook, J. et 

40 al., Molecular Cloning: A Laboratory Manual, 2d edition, Cold Spring Harbor Press, New York, 1989; Current 

Protocols in Molecular Biology. Ausubel et al., Eds., Wiley and Sons, 1995). For example, lambda phage cloning 
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methodologies can be conveniently employed, using commercially available cloning systems (e.g., Lambda ZAP 
Express, Stratagene). Phage clones containing 101P3A1 1 gene cDNAs can be identified by probing with a labeled 
101P3A1 1 cDNA or a fragment thereof. For example, in one embodiment, the 101P3A1 1 cDNA (Figure 2) or a portion 
thereof can be synthesized and used as a probe to retrieve overlapping and full-length cDNAs corresponding to a 
5 101P3A1 1 gene. The 101P3A1 1 gene itself can be isolated by screening genomic DNA libraries, bacterial artificial 

chromosome libraries (BACs), yeast artificial chromosome libraries (YACs), and the like, with 101P3A1 1 DNA probes 
or primers. 

ILA.5.) Recombinant Nucleic Acid Molecules and Host- Vector Systems 

The invention also provides recombinant DNA or RNA molecules containing an 101P3A1 1 polynucleotide, a 
10 fragment, analog or homologue thereof, including but not limited to phages, plasmids, phagemids, cosmids, YACs, 
BACs, as well as various viral and non- viral vectors well known in the art, and cells transformed or transfected with 
such recombinant DNA or RNA molecules. Methods for generating such molecules are well known (see, for example, 
Sambrook et al, 1989, supra). 

The invention further provides a host- vector system comprising a recombinant DNA molecule containing a 
15 101P3A1 1 polynucleotide, fragment, analog or homologue thereof within a suitable prokaryotic or eukaryotic host 
cell. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a 
mammalian cell or an insect cell (e.g., a baculovirus-infectible cell such as an Sf9 or HighFive cell). Examples of 
suitable mammalian cells include various prostate cancer cell lines such as DU145 and TsuPrl, other transferable 
or transducible prostate cancer cell lines, primary cells (PrEC), as well as a number of mammalian cells routinely 
20 used for the expression of recombinant proteins (e.g., COS, CHO, 293, 293T cells). More particularly, a 

polynucleotide comprising the coding sequence of 101P3A1 1 or a fragment, analog or homolog thereof can be used to 
generate 101P3A1 1 proteins or fragments thereof using any number of host- vector systems routinely used and widely 
known in the art. 

A wide range of host- vector systems suitable for the expression of 101P3A1 1 proteins or fragments thereof are 

25 available, see for example, Sambrook et al., 1989, supra; Current Protocols in Molecular Biology, 1995, supra). 

Preferred vectors for mammalian expression include but are not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and 
the retroviral vector pSRatkneo (Muller et al., 1991, MCB 1 1 : 1785). Using these expression vectors, 101P3A1 1 
can be expressed in several prostate cancer and non-prostate cell lines, including for example 293, 293T, rat-1, NIH 
3T3 and TsuPrl . The host-vector systems of the invention are useful for the production of a 101P3A1 1 protein or 

30 fragment thereof. Such host-vector systems can be employed to study the functional properties of 101P3A1 1 and 
101P3A1 1 mutations or analogs. 

Recombinant human 101P3A1 1 protein or an analog or homolog or fragment thereof can be produced by 
mammalian cells transfected with a construct encoding a 101P3A1 1 -related nucleotide. For example, 293T cells 
can be transfected with an expression plasmid encoding 101P3A1 1 or fragment, analog or homolog thereof, the 

35 101P3A1 1 or related protein is expressed in the 293T cells, and the recombinant 101P3A1 1 protein is isolated using 
standard purification methods (e.g., affinity purification using anti-101P3Al 1 antibodies). In another embodiment, 
a 101P3A1 1 coding sequence is subcloned into the retroviral vector pSRaMSVtkneo and used to infect various 
mammalian cell lines, such as NIH 3T3, TsuPrl, 293 and rat-1 in order to establish 101P3A1 1 expressing cell lines. 
Various other expression systems well known in the art can also be employed. Expression constructs encoding a 

40 leader peptide joined in frame to the 101P3A1 1 coding sequence can be used for the generation of a secreted form 
of recombinant 101P3A11 protein. 
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As discussed herein, redundancy in the genetic code permits variation in 101P3A1 1 gene sequences. In 
particular, it is known in the art that specific host species often have specific codon preferences, and thus one can 
adapt the disclosed sequence as preferred for a desired host. For example, preferred analog codon sequences 
typically have rare codons (i.e., codons having a usage frequency of less than about 20% in known sequences of the 
5 desired host) replaced with higher frequency codons. Codon preferences for a specific species are calculated, for 
example, by utilizing codon usage tables available on the INTERNET such as at URL 
www.dna.affrc.go.jp/~nakamura/codon.html. 

Additional sequence modifications are known to enhance protein expression in a cellular host. These 
include elimination of sequences encoding spurious polyadenylation signals, exon/intron splice site signals, 
10 transposon-like repeats, and/or other such well-characterized sequences that are deleterious to gene expression. The 
GC content of the sequence is adjusted to levels average for a given cellular host, as calculated by reference to 
known genes expressed in the host cell. Where possible, the sequence is modified to avoid predicted hairpin 
secondary mRNA structures. Other useful modifications include the addition of a translational initiation consensus 
sequence at the start of the open reading frame, as described in Kozak, MoL Cell Biol, 9:5073-5080 (1989). Skilled 
15 artisans understand that the general rule that eukaryotic ribosomes initiate translation exclusively at the 5' proximal 
AUG codon is abrogated only under rare conditions (see, e.g., Kozak PNAS 92(7): 2662-2666, (1995) and Kozak 
NAR 15(20): 8125-8148 (1987)). 

III.) 101P3 All-related Proteins 

Another aspect of the present invention provides 101P3A1 1-related proteins. Specific embodiments of 
20 101P3A1 1 proteins comprise a polypeptide having all or part of the amino acid sequence of human 101P3A1 1 as 
shown in Figure 2 or Figure 3. Alternatively, embodiments of 101P3A1 1 proteins comprise variant, homolog or 
analog polypeptides that have alterations in the amino acid sequence of 101P3A1 1 shown in Figure 2 or Figure 3. 

In general, naturally occurring allelic variants of human 101P3A1 1 share a high degree of structural identity 
and homology (e.g., 90% or more homology). Typically, allelic variants of the 101P3A1 1 protein contain conservative 
25 amino acid substitutions within the 101P3A1 1 sequences described herein or contain a substitution of an amino acid 
from a corresponding position in a homologue of 101P3A1 1. One class of 101P3A1 1 allelic variants are proteins that 
share a high degree of homology with at least a small region of a particular 101P3A1 1 amino acid sequence, but further 
contain a radical departure from the sequence, such as a non-conservative substitution, truncation, insertion or frame 
shift. In comparisons of protein sequences, the terms, similarity, identity, and homology each have a distinct meaning as 
30 appreciated in the field of genetics. Moreover, orthology and paralogy can be important concepts describing the 
relationship of members of a given protein family in one organism to the members of the same family in other 
organisms. 

Amino acid abbreviations are provided in Table II. Conservative amino acid substitutions can frequently 
be made in a protein without altering either the conformation or the function of the protein. Proteins of the 

35 invention can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 conservative substitutions. Such changes 
include substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these hydrophobic amino 
acids; aspartic acid (D) for glutamic acid (E) and vice versa; glutamine (Q) for asparagine (N) and vice versa; and 
serine (S) for threonine (T) and vice versa. Other substitutions can also be considered conservative, depending on 
the environment of the particular amino acid and its role in the three-dimensional structure of the protein. For 

40 example, glycine (G) and alanine (A) can frequently be interchangeable, as can alanine (A) and valine (V). 

Methionine (M), which is relatively hydrophobic, can frequently be interchanged with leucine and isoleucine, and 
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sometimes with valine. Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these two amino acid residues are 
not significant. Still other changes can be considered "conservative" in particular environments (see, e.g. Table III 
herein; pages 13-15 "Biochemistry" 2 nd ED. Lubert Stryer ed (Stanford University); Henikoff et al., PNAS 1992 
5 Vol 89 10915-10919; Lei et al., J Biol Chem 1995 May 19; 270(20): 1 1882-6). 

Embodiments of the invention disclosed herein include a wide variety of art-accepted variants or analogs 
of 101P3A1 1 proteins such as polypeptides having amino acid insertions, deletions and substitutions. 101P3A1 1 
variants can be made using methods known in the art such as site-directed mutagenesis, alanine scanning, and PCR 
mutagenesis. Site-directed mutagenesis (Carter et al., Nucl. Acids Res., 75:4331 (1986); Zoller et al., Nucl. Acids 

10 Res., 70:6487 (1987)), cassette mutagenesis (Wells et al., Gene, 34:315 (1985)), restriction selection mutagenesis 
(Wells et al., Philos. Trans. R. Soc. London SerA, 317:415 (1986)) or other known techniques can be performed on 
the cloned DNA to produce the 101P3A1 1 variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence that is involved in a specific biological activity such as a protein-protein interaction. Among 

1 5 the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids include alanine, 
glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it 
eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of the 
variant. Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently 
found in both buried and exposed positions (Creighton, The Proteins, (W.H. Freeman & Co., N.Y.); Chothia, J. 

20 Mol. Biol., 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, an isosteric amino 
acid can be used. 

As defined herein, 101P3A1 1 variants, analogs or homo logs, have the distinguishing attribute of having at 
least one epitope that is "cross reactive" with a 101P3A1 1 protein having the amino acid sequence of SEQ ID NO: 
703. As used in this sentence, "cross reactive" means that an antibody or T cell that specifically binds to an 

25 101P3A1 1 variant also specifically binds to the 101P3A1 1 protein having the amino acid sequence of SEQ ID NO: 
703. A polypeptide ceases to be a variant of the protein shown in SEQ ID NO: 703 when it no longer contains any 
epitope capable of being recognized by an antibody or T cell that specifically binds to the 101P3A1 1 protein. 
Those skilled in the art understand that antibodies that recognize proteins bind to epitopes of varying size, and a 
grouping of the order of about four or five amino acids, contiguous or not, is regarded as a typical number of amino 

30 acids in a minimal epitope. See, e.g., Nair et al., J. Immunol 2000 165(12): 6949-6955; Hebbes et al., Mol 
Immunol (1989) 26(9):865-73; Schwartz et al., J Immunol (1985) 135(4):2598-608. 

Another class of 101P3A1 1 -related protein variants share 70%, 75%, 80%, 85% or 90% or more similarity 
with the amino acid sequence of SEQ ID NO: 703 or a fragment thereof. Another specific class of 101P3A1 1 
protein variants or analogs comprise one or more of the 101P3A1 1 biological motifs described herein or presently 

35 known in the art. Thus, encompassed by the present invention are analogs of 101P3A1 1 fragments (nucleic or 

amino acid) that have altered functional (e.g. immunogenic) properties relative to the starting fragment. It is to be 
appreciated that motifs now or which become part of the art are to be applied to the nucleic or amino acid sequences 
of Figure 2 or Figure 3. 

As discussed herein, embodiments of the claimed invention include polypeptides containing less than the 
40 full amino acid sequence of the 101P3A1 1 protein shown in Figure 2 or Figure 3. For example, representative 

embodiments of the invention comprise peptides/proteins having any 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15 or more 
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contiguous amino acids of the 101P3A1 1 protein shown in Figure 2 or Figure 3. With respect to any aspect or 
embodiment of the invention, a peptide or peptides may be on one or more than one molecule and may include 
"pseudopeptides" made of amino acid -like residues that are linked by non-peptide bonds such as, but not limited to, 
-CH 2 -NH-. 

5 Moreover, representative embodiments of the invention disclosed herein include polypeptides consisting of 

about amino acid 1 to about amino acid 10 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polypeptides 
consisting of about amino acid 10 to about amino acid 20 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, 
polypeptides consisting of about amino acid 20 to about amino acid 30 of the 101P3A1 1 protein shown in Figure 2 
or Figure 3, polypeptides consisting of about amino acid 30 to about amino acid 40 of the 101P3A1 1 protein shown 

10 in Figure 2 or Figure 3, polypeptides consisting of about amino acid 40 to about amino acid 50 of the 101P3A1 1 

protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 50 to about amino acid 60 of the 
101P3A1 1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 60 to about amino 
acid 70 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 70 to 
about amino acid 80 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about 

15 amino acid 80 to about amino acid 90 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, polypeptides 

consisting of about amino acid 90 to about amino acid 100 of the 101P3A1 1 protein shown in Figure 2 or Figure 3, 
etc. throughout the entirety of the 101P3A1 1 amino acid sequence. Moreover, polypeptides consisting of about 
amino acid 1 (or 20 or 30 or 40 etc.) to about amino acid 20, (or 130, or 140 or 150 etc.) of the 101P3A1 1 protein 
shown in Figure 2 or Figure 3 are embodiments of the invention. It is to be appreciated that the starting and 

20 stopping positions in this paragraph refer to the specified position as well as that position plus or minus 5 residues. 

101P3A1 1 -related proteins are generated using standard peptide synthesis technology or using chemical 
cleavage methods well known in the art. Alternatively, recombinant methods can be used to generate nucleic acid 
molecules that encode a 101P3A1 1 -related protein. In one embodiment, nucleic acid molecules provide a means to 
generate defined fragments of the 101P3A1 1 protein (or variants, homologs or analogs thereof). 

25 IILA.) Motif-bearing Protein Embodiments 

Additional illustrative embodiments of the invention disclosed herein include 101P3A1 1 polypeptides 
comprising the amino acid residues of one or more of the biological motifs contained within the 101P3A1 1 
polypeptide sequence set forth in Figure 2 or Figure 3. Various motifs are known in the art, and a protein can be 
evaluated for the presence of such motifs by a number of publicly available Internet sites (see, e.g., URL addresses: 

30 pfam.wustl.edu/; searchlauncher.bcm.tmc.edu/seq-search/struc-predict.html; psort.ims.u-tokyo.ac.jp/; 

www.cbs.dtu.dk; www.ebi.ac.uk/interpro/scan.html; www.expasy.ch/tools/scnpsitl.html; Epimatrix™ andEpimer™, 
Brown University, www.brown.edu/Research/TB-HIV_I^ and BIMAS, 

bimas.dcrt.nih.gov/.). 

Motif bearing subsequences of the 101P3A1 1 protein are set forth and identified in Table XIX. 
35 Table XX sets forth several frequently occurring motifs based on pfam searches (see URL address 

pfam.wustl.edu/). The columns of Table XX list (1) motif name abbreviation, (2) percent identity found amongst 
the different member of the motif family, (3) motif name or description and (4) most common function; location 
information is included if the motif is relevant for location. 

Polypeptides comprising one or more of the 101P3A1 1 motifs discussed above are useful in elucidating the 
40 specific characteristics of a malignant phenotype in view of the observation that the 101P3A1 1 motifs discussed 
above are associated with growth dysregulation and because 101P3A1 1 is expressed or overexpressed in certain 
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cancers (See, e.g., Table I). Casein kinase II, cAMP and camp-dependent protein kinase, and Protein Kinase C, for 
example, are enzymes known to be associated with the development of the malignant phenotype (see e.g. Chen et 
al., Lab Invest., 78(2): 165-174 (1998); Gaiddon et al, Endocrinology 136(10): 4331^338 (1995); Hall et al., 
Nucleic Acids Research 24(6): 1 1 19-1 126 (1996); Peterziel et al., Oncogene 18(46): 6322-6329 (1999) and O'Brian, 
5 Oncol. Rep. 5(2): 305-309 (1998)). Moreover, both glycosylation and myristoylation are protein modifications also 
associated with cancer and cancer progression (see e.g. Dennis et al., Biochem. Biophys. Acta 1473(l):21-34 
(1999); Raju et al., Exp. Cell Res. 235(1): 145-154 (1997)). Amidation is another protein modification also 
associated with cancer and cancer progression (see e.g. Treston et al., J. Natl. Cancer Inst. Monogr. (13): 169-175 
(1992)). 

1 0 In another embodiment, proteins of the invention comprise one or more of the immunoreactive epitopes 

identified in accordance with art-accepted methods, such as the peptides set forth in Tables V-XVTII or XXVI- 
XXVIII. CTL epitopes can be determined using specific algorithms to identify peptides within an 101P3A1 1 protein 
that are capable of optimally binding to specified HLA alleles (e.g., Table IV; Epimatrix™ and Epimer™, Brown 
University, URL brown.edu/Research/TB-HIV_Lab/epimatrix/epm and BIMAS, URL bimas.dcrt.nih.gov/.) 

15 Moreover, processes for identifying peptides that have sufficient binding affinity for HLA molecules and which are 
correlated with being immunogenic epitopes, are well known in the art, and are carried out without undue 
experimentation. In addition, processes for identifying peptides that are immunogenic epitopes, are well known in 
the art, and are carried out without undue experimentation either in vitro or in vivo. 

Also known in the art are principles for creating analogs of such epitopes in order to modulate 

20 immunogenicity. For example, one begins with an epitope that bears a CTL or HTL motif (see, e.g., the HLA Class 
I and HLA Class II motifs/supermotifs of Table IV). The epitope is analoged by substituting out an amino acid at 
one of the specified positions, and replacing it with another amino acid specified for that position. For example, 
one can substitute out a deleterious residue in favor of any other residue, such as a preferred residue as defined in 
Table IV; substitute a less-preferred residue with a preferred residue as defined in Table IV; or substitute an 

25 originally-occurring preferred residue with another preferred residue as defined in Table IV. Substitutions can 
occur at primary anchor positions or at other positions in a peptide; see, e.g., Table IV. 

A variety of references reflect the art regarding the identification and generation of epitopes in a protein of 
interest as well as analogs thereof. See, for example, WO 9733602 to Chesnut et al.; Sette, Immunogenetics 1999 
50(3-4): 201-212; Sette et al., J. Immunol. 2001 166(2): 1389-1397; Sidney et al., Hum. Immunol. 1997 58(1): 12- 

30 20; Kondo et al., Immunogenetics 1997 45(4): 249-258; Sidney et al., J. Immunol. 1996 157(8): 3480-90; and Falk 
et al., Nature 351: 290-6 (1991); Hunt et al., Science 255:1261-3 (1992); Parker et al., J. Immunol. 149:3580-7 
(1992); Parker et al., J. Immunol. 152:163-75 (1994)); Kast et al., 1994 152(8): 3904-12; Borras-Cuesta et al., Hum. 
Immunol. 2000 61(3): 266-278; Alexander et al., J. Immunol. 2000 164(3); 164(3): 1625-1633; Alexander et al., 
PMID: 7895164, UI: 95202582; O'Sullivan et al., J. Immunol. 1991 147(8): 2663-2669; Alexander et al., Immunity 

35 1994 1(9): 751-761 and Alexander et al., Immunol. Res. 1998 18(2): 79-92. 

Related embodiments of the inventions include polypeptides comprising combinations of the different 
motifs set forth in Table XIX, and/or, one or more of the predicted CTL epitopes of Table V through Table XVIII, 
and/or, one or more of the T cell binding motifs known in the art. Preferred embodiments contain no insertions, 
deletions or substitutions either within the motifs or the intervening sequences of the polypeptides. In addition, 

40 embodiments which include a number of either N-terminal and/or C-terminal amino acid residues on either side of 
these motifs may be desirable (to, for example, include a greater portion of the polypeptide architecture in which the 
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motif is located). Typically the number of N-terminal and/or C-terminal amino acid residues on either side of a 
motif is between about 1 to about 100 amino acid residues, preferably 5 to about 50 amino acid residues. 

101P3A1 1 -related proteins are embodied in many forms, preferably in isolated form. A purified 101P3A1 1 
protein molecule will be substantially free of other proteins or molecules that impair the binding of 101P3A1 1 to 
5 antibody, T cell or other ligand. The nature and degree of isolation and purification will depend on the intended use. 
Embodiments of a 101P3A1 1 -related proteins include purified 101P3A1 1 -related proteins and functional, soluble 
101P3A1 1-related proteins. In one embodiment, a functional, soluble 101P3A1 1 protein or fragment thereof retains 
the ability to be bound by antibody, T cell or other ligand. 

The invention also provides 101P3A1 1 proteins comprising biologically active fragments of the 101P3A1 1 
10 amino acid sequence shown in Figure 2 or Figure 3. Such proteins exhibit properties of the 101P3A1 1 protein, such 
as the ability to elicit the generation of antibodies that specifically bind an epitope associated with the 101P3A1 1 
protein; to be bound by such antibodies; to elicit the activation of HTL or CTL; and/or, to be recognized by HTL or 
CTL. 

101P3A1 1-related polypeptides that contain particularly interesting structures can be predicted and/or 

1 5 identified using various analytical techniques well known in the art, including, for example, the methods of Chou- 

Fasman, Garnier-Robson, Eisenberg, Karplus-Schultz or Jameson-Wolf analysis, Hopp T.P. and Woods K.R., Kyte J. 
and Doolittle R.F., Janin J., Bhaskaran R., and Ponnuswamy P.K., Deleage, G. and Roux B., or on the basis of 
immunogenicity. Fragments that contain such structures are particularly useful in generating subunit-specific anti- 
101P3A1 1 antibodies, or T cells or in identifying cellular factors that bind to 101P3A1 1. 

20 CTL epitopes can be determined using specific algorithms to identify peptides within an 101P3A1 1 protein 

that are capable of optimally binding to specified HLA alleles (e.g., by using the SYFPEITHI site at World Wide Web 
URL syfpeithi.bmi-heidelberg.com/; the listings in Table IV(A)-(E); Epimatrix™ and Epimer™, Brown University, 
URL (www.brown.edu/Research/TB-HIV_Lab/epimatrix/epimatm and BIMAS, URL bimas.dcrt.nih.gov/). 

Illustrating this, peptide epitopes from 101P3A1 1 that are presented in the context of human MHC class I molecules 

25 HLA-A1 , A2, A3, Al 1 , A24, B7 and B35 were predicted (Tables V-XVIII). Specifically, the complete amino acid 
sequence of the 101P3A1 1 protein was entered into the HLA Peptide Motif Search algorithm found in the 
Bioinformatics and Molecular Analysis Section (BIMAS) web site listed above. The HLA peptide motif search 
algorithm was developed by Dr. Ken Parker based on binding of specific peptide sequences in the groove of HLA 
Class I molecules, in particular HLA-A2 (see, e.g., Falk et al., Nature 351: 290-6 (1991); Hunt et al., Science 

30 255:1261-3 (1992); Parker et al., J. Immunol. 149:3580-7 (1992); Parker et al., J. Immunol. 152:163-75 (1994)). 

This algorithm allows location and ranking of 8-mer, 9-mer, and 10-mer peptides from a complete protein sequence 
for predicted binding to HLA-A2 as well as numerous other HLA Class I molecules. Many HLA class I binding 
peptides are 8-, 9-, 10 or 1 1-mers. For example, for class I HLA-A2, the epitopes preferably contain a leucine (L) 
or methionine (M) at position 2 and a valine (V) or leucine (L) at the C- terminus (see, e.g., Parker et al., J. 

35 Immunol. 149:3580-7 (1992)). Selected results of 101P3A1 1 predicted binding peptides are shown in Tables V- 
XVIII or XXVI-XXVIII herein. In Tables V-XVIII, the top 50 ranking candidates, 9-mers and 10-mers, for each 
family member are shown along with their location, the amino acid sequence of each specific peptide, and an 
estimated binding score. The binding score corresponds to the estimated half time of dissociation of complexes 
containing the peptide at 37°C at pH 6.5. Peptides with the highest binding score are predicted to be the most 

40 tightly bound to HLA Class I on the cell surface for the greatest period of time and thus represent the best 
immunogenic targets for T-cell recognition. 
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In the BIMAS site, the algorithm used to score each 8-mer, 9-mer, or 10-mer peptide subsequence is as 
follows: The initial (running) score is set to 1.0. For each residue position, the program examines which amino 
acid appears at that position. The running score is then multiplied by the coefficient for that amino acid type, at that 
position, for the chosen HLA molecule. These coefficients were pre-calculated and were stored for use by the 
5 scoring algorithm in a separate directory as a collection of HLA coefficient files. Using 9-mers, nine 

multiplications are performed. Using 1 0-mers, nine multiplications are again performed, because the residue lying 
at the fifth position in the subsequence is skipped. The resulting running score is multiplied by a final constant to 
yield an estimate of the half time of disassociation. (This constant is stored at the end of the coefficient file for the 
HLA molecule. It can have a different value for each HLA molecule.) The final multiplication yields the score 

10 reported in the output table. Using 8-mers, eight multiplications are performed instead of nine, with a different 

coefficient matrix employed (20-by-8 rather than 20-by-9). For each HLA molecule evaluated by the BIMAS site, 
the coefficient values discussed above are stored in a file in a separate directory of coefficient files. When the user 
selects the HLA molecule type (or, on the restricted Web page for advanced users, selects the actual filename from 
the coefficient filename menu) and selects "9-mer" or "10-mer" as the length of the subsequence, the "standard" file 

1 5 containing the 20-by-9 coefficient matrix for the HLA molecule is read on-the-fly, with the 181 values (180 

coefficient values plus one final constant) being read into an internal array for use by the scoring program. If the 
user selected " 8-mer" as the subsequence length, then the program proceeds in the same fashion, but reads in a 
different coefficient file appropriate for 8-mer searches on that selected HLA molecule type. 

In addition epitopes were identified from the SYFPEITHI site (SEQ ID NO: 2875) listed above. The 

20 scoring system for the SYFPEITHI site evaluates every amino acid within a given peptide. Individual amino acids 
may be given the arbitrary value 1 for amino acids that are only slightly preferred in the respective position, optimal 
anchor residues are given the value 15; any value between these two is possible. Negative values are also possible 
for amino acids which are disadvantageous for the peptide's binding capacity at a certain sequence position. The 
allocation of values is based on the frequency of the respective amino acid in natural ligands, T-cell epitopes, or 

25 binding peptides. Further explanations on the algorithm can be found in Rammensee, HG, et al. (1999) 

SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics 50: 213-219. In order to have an 
extremely reliable SYFPEITHI epitope prediction algorithm, only those MHC class I alleles for which a large 
amount of data is available are included in the "epitope prediction" section of SYFPEITHI. A reliability of at least 
80% in retrieving the most apt epitope can be expected. Thus the naturally presented epitope should be among the 

30 top-scoring 2 % of all peptides predicted in 80 % of all predictions. For example, if a protein sequence is as long as 
400 amino acids, it can be split into 392 possible nonameric peptides. Therefore, the correct epitope should then be 
among the 8 top-scoring peptides. For epitope predictions using MHC class II motifs high reliabilities usually 
cannot be achieved because of the more variable pocket binding behavior. A reliability of approximately 50% only 
is expected for HLA class II epitopes. 

35 Also, different MHC class I molecules prefer a different length of ligands. For example, SYFPEITHI 

offers predictions for H2-Kb octamers, HLA-A*0201 nonamers and decamers, or HLA-B8 octamers and 
nonamers.' The maximal scores vary between different MHC alleles. Therefore, one can include known 
ligands/epitopes in order to have an approximation of the scoring. For example, the maximal score for HLA- 
A*0201 peptides is 36. The well-known epitope GILGFVFTL derived from the influenza A matrix protein scores 

40 30.- All predicted MHC class II ligands are 15mers, consisting of three N-terminal flanking residues, the nonamer 


sd-63712 


27 


core sequence located within the binding groove, and three C-terminal flanking residues. Thus, anchor residue PI 
appears in position 4 of the peptides predicted with "SYFPEITHI". 

Accordingly, identification of HLA Class I and Class II immunogenic peptides for 101P3A1 1 was carried 
out using the SYFPEITHI database for MHC ligands and the accompanying prediction algorithm (Hans-Georg 
5 Rammensee, Jutta Bachmann, Niels Nikolaus Emmerich, Oskar Alexander Bachor, Stefan Stevanovic: 

SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics 1999, 50: 213-219) accessed through 
the website with a URL: syfpeitm.bmi-heidelberg.com/scripts/MHC Server. dll/home.htm. (Tables XXVI-XXVIII) 
For HLA Class I alleles nonameric (Table XXVI) and decameric peptides (Table XXVII) were identified. Given 
the nature of the HLA Class II binding pocket, for HLA Class II alleles sample 15-mer peptides were identified 
10 (Table XXVIII). It is appreciated by those in the art that additional amino acids can be present on the termini of the 
1 5-mer epitopes, or that less than 1 5 amino acids can be present, and that the epitope will still be bound in the open- 
ended HLA Class II binding pocket. 

For each given HLA allele, listed in Tables XXVI-XXVIII are the peptides whose score ranks within the 
top 50% of the highest scoring peptide(s) predicted for that allele. For example, if the highest scoring value for 
15 nonamers of the HLA-A1 allele is 30, all peptides that are given a score of 15 or higher are listed. Listed in the 
tables is the amino acid position within the 101P3A1 1 protein sequence for the first amino acid of each listed 
peptide (Pos), the peptide sequence, and the relative ranked score (Score). 

Actual binding of peptides to an HLA allele can be evaluated by stabilization of HLA expression on the 
antigen-processing defective cell line T2 (see, e.g., Xue et al., Prostate 30:73-8 (1997) and Peshwa et al., Prostate 
20 36:129-38 (1998)). Immunogenicity of specific peptides can be evaluated in vitro by stimulation of CD8+ cytotoxic 
T lymphocytes (CTL) in the presence of antigen presenting cells such as dendritic cells. 

It is to be appreciated that every epitope predicted by the BIMAS site, Epimer™ and Epimatrix™ sites, or 
specified by the HLA class I or class II motifs available in the art or which become part of the art such as set forth 
in Table IV (or determined using World Wide Web site URL syfpeithi.bmi-heidelberg.com/) are to be "applied" to 
25 the 101P3A1 1 protein. As used in this context "applied" means that the 101P3A1 1 protein is evaluated, e.g., 

visually or by computer-based patterns finding methods, as appreciated by those of skill in the relevant art. Every 
subsequence of the 101P3A1 1 of 8, 9, 10, or 1 1 amino acid residues that bears an HLA Class I motif, or a 
subsequence of 9 or more amino acid residues that bear an HLA Class II motif are within the scope of the invention. 
IILB.) Expression of 101P3 Al 1-related Proteins 
30 In an embodiment described in the examples that follow, 101P3A1 1 can be conveniently expressed in cells 

(such as 293T cells) transfected with a commercially available expression vector such as a CMV-driven expression 
vector encoding 101P3A1 1 with a C-terminal 6XHis and MYC tag (pcDNA3.1/mycHIS, Invitrogen or Tag5, 
GenHunter Corporation, Nashville TN). The Tag5 vector provides an IgGK secretion signal that can be used to 
facilitate the production of a secreted 101P3A1 1 protein in transfected cells. The secreted HIS-tagged 101P3A1 1 in 
35 the culture media can be purified, e.g., using a nickel column using standard techniques. 
III.C.) Modifications of 101P3All-related Proteins 

Modifications of 101P3A1 1-related proteins such as covalent modifications are included within the scope 

of this invention. One type of covalent modification includes reacting targeted amino acid residues of a 101P3A1 1 

polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- 

40 terminal residues of the 101P3A1 1 . Another type of covalent modification of the 101P3A1 1 polypeptide included 

within the scope of this invention comprises altering the native glycosylation pattern of a protein of the invention. 
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Another type of covalent modification of 101P3A1 1 comprises linking the 101P3A1 1 polypeptide to one of a 
variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, 
in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The 10 1P3A1 1 -related proteins of the present invention can also be modified to form a chimeric molecule 
5 comprising 101P3A1 1 fused to another, heterologous polypeptide or amino acid sequence. Such a chimeric 

molecule can be synthesized chemically or recombinantly. A chimeric molecule can have a protein of the invention 
fused to another tumor-associated antigen or fragment thereof. Alternatively, a protein in accordance with the 
invention can comprise a fusion of fragments of the 101P3A1 1 sequence (amino or nucleic acid) such that a 
molecule is created that is not, through its length, directly homologous to the amino or nucleic acid sequences 

10 shown in Figure 2 or Figure 3. Such a chimeric molecule can comprise multiples of the same subsequence of 

101P3A1 1. A chimeric molecule can comprise a fusion of a 101P3A1 1 -related protein with a polyhistidine epitope 
tag, which provides an epitope to which immobilized nickel can selectively bind, with cytokines or with growth 
factors. The epitope tag is generally placed at the amino- or carboxyl- terminus of the 101P3A1 1. In an alternative 
embodiment, the chimeric molecule can comprise a fusion of a 101P3A1 1 -related protein with an immunoglobulin 

15 or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an 
"immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions preferably include 
the substitution of a soluble (transmembrane domain deleted or inactivated) form of a 101P3A1 1 polypeptide in 
place of at least one variable region within an Ig molecule. In a preferred embodiment, the immunoglobulin fusion 
includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 regions of an IgGI molecule. For the 

20 production of immunoglobulin fusions see, e.g., U.S. Patent No. 5,428,130 issued June 27, 1995. 
III.D.) Uses of 101P3All-related Proteins 

The proteins of the invention have a number of different specific uses. As 101P3A1 1 is highly expressed 
in prostate and other cancers, 101P3A1 1 -related proteins are used in methods that assess the status of 101P3A1 1 
gene products in normal versus cancerous tissues, thereby elucidating the malignant phenotype. Typically, 

25 polypeptides from specific regions of the 101P3A1 1 protein are used to assess the presence of perturbations (such 
as deletions, insertions, point mutations etc.) in those regions (such as regions containing one or more motifs). 
Exemplary assays utilize antibodies or T cells targeting 101P3A1 1 -related proteins comprising the amino acid 
residues of one or more of the biological motifs contained within the 101P3A1 1 polypeptide sequence in order to 
evaluate the characteristics of this region in normal versus cancerous tissues or to elicit an immune response to the 

30 epitope. Alternatively, 101P3A1 1-related proteins that contain the amino acid residues of one or more of the 

biological motifs in the 101P3A1 1 protein are used to screen for factors that interact with that region of 101P3A1 1. 

101P3A1 1 protein fragments/subsequences are particularly useful in generating and characterizing domain- 
specific antibodies (e.g., antibodies recognizing an extracellular or intracellular epitope of an 101P3A1 1 protein), for 
identifying agents or cellular factors that bind to 101P3A1 1 or a particular structural domain thereof, and in various 

35 therapeutic and diagnostic contexts, including but not limited to diagnostic assays, cancer vaccines and methods of 
preparing such vaccines. 

Proteins encoded by the 101P3A1 1 genes, or by analogs, homologs or fragments thereof, have a variety of 
uses, including but not limited to generating antibodies and in methods for identifying ligands and other agents and 
cellular constituents that bind to an 101P3A1 1 gene product. Antibodies raised against an 101P3A1 1 protein or 
40 fragment thereof are useful in diagnostic and prognostic assays, and imaging methodologies in the management of 
human cancers characterized by expression of 101P3A1 1 protein, such as those listed in Table I. Such antibodies 
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can be expressed intracellularly and used in methods of treating patients with such cancers. 10 1P3A1 1 -related 
nucleic acids or proteins are also used in generating HTL or CTL responses. 

Various immunological assays useful for the detection of 101P3A1 1 proteins are used, including but not 
limited to various types of radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked 
5 immunofluorescent assays (ELIFA), immunocytochemical methods, and the like. Antibodies can be labeled and used as 
immunological imaging reagents capable of detecting 101P3A1 1 -expressing cells (e.g., in radioscintigraphic imaging 
methods). 101P3A1 1 proteins are also particularly useful in generating cancer vaccines, as further described herein. 
IV.) 101P3A11 Antibodies 

Another aspect of the invention provides antibodies that bind to 101P3A1 1 -related proteins. Preferred 
10 antibodies specifically bind to a 101P3A1 1 -related protein and do not bind (or bind weakly) to peptides or proteins that 

are not 101P3A1 1 -related proteins. For example, antibodies bind 101P3A1 1 can bind 101P3A1 1 -related proteins such 

as the homologs or analogs thereof. 

101P3A1 1 antibodies of the invention are particularly useful in cancer diagnostic, diagnostic and 

prognostic assays, and imaging methodologies. Similarly, such antibodies are useful in the treatment, diagnosis, 
15 and/or prognosis of other cancers, to the extent 101P3A1 1 is also expressed or overexpressed in these other cancers. 

Moreover, intracellularly expressed antibodies (e.g., single chain antibodies) are therapeutically useful in treating 

cancers in which the expression of 101P3A1 1 is involved, such as advanced or metastatic prostate cancers. 

The invention also provides various immunological assays useful for the detection and quantification of 

101P3A1 1 and mutant 101P3A1 1 -related proteins. Such assays can comprise one or more 101P3A1 1 antibodies 
20 capable of recognizing and binding a 101P3A1 1 -related protein, as appropriate. These assays are performed within 

various immunological assay formats well known in the art, including but not limited to various types of 

radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked immunofluorescent assays 

(ELIFA), and the like. 

Immunological non-antibody assays of the invention also comprise T cell immunogenicity assays (inhibitory 

25 or stimulatory) as well as major histocompatibility complex (MHC) binding assays. 

In addition, immunological imaging methods capable of detecting prostate cancer and other cancers expressing 
101P3A1 1 are also provided by the invention, including but not limited to radioscintigraphic imaging methods using 
labeled 101P3A1 1 antibodies. Such assays are clinically useful in the detection, monitoring, and prognosis of 
101P3A1 1 expressing cancers such as prostate cancer. 

30 101P3A11 antibodies are also used in methods for purifying a 101P3A1 1 -related protein and for isolating 

101P3A1 1 homologues and related molecules. For example, a method of purifying a 101P3A1 1 -related protein 
comprises incubating an 101P3A1 1 antibody, which has been coupled to a solid matrix, with a lysate or other solution 
containing a 101P3A1 1 -related protein under conditions that permit the 101P3A1 1 antibody to bind to the 101P3A1 1- 
related protein; washing the solid matrix to eliminate impurities; and eluting the 101P3A1 1 -related protein from the 

35 coupled antibody. Other uses of the 101P3A1 1 antibodies of the invention include generating anti-idiotypic 
antibodies that mimic the 101P3A1 1 protein. 

Various methods for the preparation of antibodies are well known in the art. For example, antibodies can be 
prepared by immunizing a suitable mammalian host using a 101P3A1 1 -related protein, peptide, or fragment, in isolated 
or immunoconjugated form (Antibodies: A Laboratory Manual, CSH Press, Eds., Harlow, and Lane (1988); Harlow, 

40 Antibodies, Cold Spring Harbor Press, NY (1989)). In addition, fusion proteins of 101P3A1 1 can also be used, such as 
a 101P3A1 1 GST-fusion protein. In a particular embodiment, a GST fusion protein comprising all or most of the amino 
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acid sequence of Figure 2 or Figure 3 is produced, then used as an immunogen to generate appropriate antibodies. In 
another embodiment, a 101P3A1 1 -related protein is synthesized and used as an immunogen. 

In addition, naked DNA immunization techniques known in the art are used (with or without purified 
101P3A1 1-related protein or 101P3A1 1 expressing cells) to generate an immune response to the encoded immunogen 
5 (for review, see Donnelly et al., 1997, Ann. Rev. Immunol. 15: 617-648). 

The amino acid sequence of 101P3A1 1 as shown in Figure 2 or Figure 3 can be analyzed to select specific 
regions of the 101P3A1 1 protein for generating antibodies. For example, hydrophobicity and hydrophilicity analyses of 
the 101P3A1 1 amino acid sequence are used to identify hydrophilic regions in the 101P3A1 1 structure. Regions of the 
101P3A1 1 protein that show immunogenic structure, as well as other regions and domains, can readily be identified 

10 using various other methods known in the art, such as Chou-Fasman, Garnier-Robson, Eisenberg, Karplus-Schultz or 
Jameson- Wolf analysis, Hopp T.P. and Woods K.R., Kyte J. and Doolittle R.F., Janin J., Bhaskaran R., and 
Ponnuswamy P.K., or, Deleage, G. and Roux B. Thus, each region identified by any of these programs or methods is 
within the scope of the present invention. Methods for the generation of 101P3A1 1 antibodies are further illustrated by 
way of the examples provided herein. Methods for preparing a protein or polypeptide for use as an immunogen are well 

1 5 known in the art. Also well known in the art are methods for preparing immunogenic conjugates of a protein with a 
carrier, such as BSA, KLH or other carrier protein. In some circumstances, direct conjugation using, for example, 
carbodiimide reagents are used; in other instances linking reagents such as those supplied by Pierce Chemical Co., 
Rockford, IL, are effective. Administration of a 101P3A1 1 immunogen is often conducted by injection over a suitable 
time period and with use of a suitable adjuvant, as is understood in the art. During the immunization schedule, titers of 

20 antibodies can be taken to determine adequacy of antibody formation. 

101P3A1 1 monoclonal antibodies can be produced by various means well known in the art. For example, 
immortalized cell lines that secrete a desired monoclonal antibody are prepared using the standard hybridoma 
technology of Kohler and Milstein or modifications that immortalize antibody-producing B cells, as is generally known. 
Immortalized cell lines that secrete the desired antibodies are screened by immunoassay in which the antigen is a 

25 101P3A1 1-related protein. When the appropriate immortalized cell culture is identified, the cells can be expanded and 
antibodies produced either from in vitro cultures or from ascites fluid. 

The antibodies or fragments of the invention can also be produced, by recombinant means. Regions that bind 
specifically to the desired regions of the 101P3A1 1 protein can also be produced in the context of chimeric or 
complementarity determining region (CDR) grafted antibodies of multiple species origin. Humanized or human 

30 101P3A1 1 antibodies can also be produced, and are preferred for use in therapeutic contexts. Methods for humanizing 
murine and other non-human antibodies, by substituting one or more of the non-human antibody CDRs for 
corresponding human antibody sequences, are well known (see for example, Jones et al., 1986, Nature 32 1 : 522-525; 
Riechmann et al., 1988, Nature 332: 323-327; Verhoeyen et al., 1988, Science 239: 1534-1536). See also, Carter et al., 
1993, Proc. Natl. Acad. Sci. USA 89: 4285 and Sims et al., 1993, J. Immunol. 151 : 2296. 

35 Methods for producing fully human monoclonal antibodies include phage display and transgenic methods (for 

review, see Vaughan et al., 1998, Nature Biotechnology 16: 535-539). Fully human 101P3A1 1 monoclonal antibodies 
can be generated using cloning technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, Building an in vitro immune system: human antibodies from phage display libraries. In: 
Protein Engineering of Antibody Molecules for Prophylactic and Therapeutic Applications in Man, Clark, M. (Ed.), 

40 Nottingham Academic, pp 45-64 (1993); Burton and Barbas, Human Antibodies from combinatorial libraries. Id-, pp 
65-82). Fully human 101P3A1 1 monoclonal antibodies can also be produced using transgenic mice engineered to 
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contain human immunoglobulin gene loci as described in PCT Patent Application W098/24893, Kucherlapati and 
Jakobovits et al., published December 3, 1997 (see also, Jakobovits, 1998, Exp. Opin. Invest. Drugs 7(4): 607-614; U.S. 
patents 6,162,963 issued 19 December 2000; 6,150,584 issued 12 November 2000; and, 6,1 14598 issued 5 September 
2000). This method avoids the in vitro manipulation required with phage display technology and efficiently produces 
5 high affinity authentic human antibodies. 

Reactivity of 101P3A1 1 antibodies with an 101P3A1 1-related protein can be established by a number of 
well known means, including Western blot, immunoprecipitation, ELISA, and FACS analyses using, as appropriate, 
101P3A1 1-related proteins, 101P3A1 1 -expressing cells or extracts thereof. A 101P3A1 1 antibody or fragment 
thereof can be labeled with a detectable marker or conjugated to a second molecule. Suitable detectable markers 

10 include, but are not limited to, a radioisotope, a fluorescent compound, a bioluminescent compound, 

chemiluminescent compound, a metal chelator or an enzyme. Further, bi-specific antibodies specific for two or 
more 101P3A1 1 epitopes are generated using methods generally known in the art. Homodimeric antibodies can 
also be generated by cross-linking techniques known in the art (e.g., Wolff et al., Cancer Res. 53: 2560-2565). 
V.) 1Q1P3A11 Cellular Immune Responses 

1 5 The mechanism by which T cells recognize antigens has been delineated. Efficacious peptide epitope 

vaccine compositions of the invention induce a therapeutic or prophylactic immune responses in very broad 
segments of the world-wide population. For an understanding of the value and efficacy of compositions of the 
invention that induce cellular immune responses, a brief review of immunology-related technology is provided. 

A complex of an HLA molecule and a peptidic antigen acts as the ligand recognized by HLA-restricted T 

20 cells (Buus, S. et al, Cell 47:1071, 1986; Babbitt, B. P. et al., Nature 317:359, 1985; Townsend, A. and Bodmer, 
H., Annu. Rev. Immunol. 7:601, 1989; Germain, R. N., Annu. Rev. Immunol. 1 1:403, 1993). Through the study of 
single amino acid substituted antigen analogs and the sequencing of endogenously bound, naturally processed 
peptides, critical residues that correspond to motifs required for specific binding to HLA antigen molecules have 
been identified and are set forth in Table IV {see also, e.g., Southwood, et al.,J. Immunol 160:3363, 1998; 

25 Rammensee, et al., Immunogenetics 41 : 178, 1995; Rammensee et al., SYFPEITHI, access via World Wide Web at 
URL syfpeithi.bmi-heidelberg.com/; Sette, A. and Sidney, J. Curr. Opin. Immunol 10:478, 1998; Engelhard, V. H., 
Curr. Opin. Immunol. 6:13, 1994; Sette, A. and Grey, H. M., Curr. Opin. Immunol 4:79, 1992; Sinigaglia, F. and 
Hammer, J. Curr. Biol. 6:52, 1994; Ruppert et al, Cell 74:929-937, 1993; Kondo et al, J. Immunol. 155:4307- 
4312, 1995; Sidney et al, J. Immunol 157:3480-3490, 1996; Sidney et al, Human Immunol 45:79-93, 1996; Sette, 

30 A. and Sidney, J. Immunogenetics 1999 Nov; 50(3-4):201-12, Review). 

Furthermore, x-ray crystallographic analyses of HLA-peptide complexes have revealed pockets within the 
peptide binding cleft/groove of HLA molecules which accommodate, in an allele-specific mode, residues borne by 
peptide ligands; these residues in turn determine the HLA binding capacity of the peptides in which they are 
present. (See, e.g., Madden, D.R. Annu. Rev. Immunol 13:587, 1995; Smith, et al, Immunity 4:203, 1996; Fremont 

35 et al, Immunity 8:305, 1998; Stern et al, Structure 2:245, 1994; Jones, E.Y. Curr. Opin. Immunol. 9:75, 1997; 

Brown, J. H. et al, Nature 364:33, 1993; Guo, H. C. et al, Proc. Natl Acad. Sci. USA 90:8053, 1993; Guo, H. C. et 
al, Nature 360:364, 1992; Silver, M. L. et al, Nature 360:367, 1992; Matsumura, M. et al, Science 257:927, 1992; 
Madden et al, Cell 70:1035, 1992; Fremont, D. H. et al, Science 257:919, 1992; Saper, M. A. , Bjorkman, P. J. and 
Wiley, D. C.,J. Mol. Biol. 219:277, 1991.) 
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Accordingly, the definition of class I and class II allele-specific HLA binding motifs, or class I or class II 
supermotifs allows identification of regions within a protein that are correlated with binding to particular HLA 
antigen(s). 

Thus, by a process of HLA motif identification, candidates for epitope-based vaccines have been 
5 identified; such candidates can be further evaluated by HLA-peptide binding assays to determine binding affinity 
and/or the time period of association of the epitope and its corresponding HLA molecule. Additional confirmatory 
work can be performed to select, amongst these vaccine candidates, epitopes with preferred characteristics in terms 
of population coverage, and/or immunogenicity. 

Various strategies can be utilized to evaluate cellular immunogenicity, including: 
10 1) Evaluation of primary T cell cultures from normal individuals (see, e.g., Wentworth, P. A. et al, Mol 

Immunol. 32:603, 1995; Celis, E. et al,Proc. Natl. Acad. Sci. USA 91:2105, 1994; Tsai, V. et al, J. Immunol 
158:1796, 1997; Kawashima, I. et al, Human Immunol. 59:1, 1998). This procedure involves the stimulation of 
peripheral blood lymphocytes (PBL) from normal subjects with a test peptide in the presence of antigen presenting 
cells in vitro over a period of several weeks. T cells specific for the peptide become activated during this time and 
15 are detected using, e.g., a lymphokine- or ^Cr-release assay involving peptide sensitized target cells. 

2) Immunization of HLA transgenic mice {see, e.g., Wentworth, P. A. et al,J. Immunol 26:97, 1996; 
Wentworth, P. A. et al, Int. Immunol 8:651, 1996; Alexander, J. et al,J. Immunol. 159:4753, 1997). For example, 
in such methods peptides in incomplete Freund's adjuvant are administered subcutaneously to HLA transgenic 
mice. Several weeks following immunization, splenocytes are removed and cultured in vitro in the presence of test 

20 peptide for approximately one week. Peptide-specific T cells are detected using, e.g., a ^Cr-release assay 
involving peptide sensitized target cells and target cells expressing endogenously generated antigen. 

3) Demonstration of recall T cell responses from immune individuals who have been either effectively 
vaccinated and/or from chronically ill patients (see, e.g., Rehermann, B. et al,J. Exp. Med. 181:1047, 1995; 
Doolan, D. L. et al, Immunity 7:97, 1997; Bertoni, R. et al, J. Clin. Invest. 100:503, 1997; Threlkeld, S. C. et al, J. 

25 Immunol 159:1648, 1997; Diepolder, H. M. et al,J. Virol. 71:601 1, 1997). Accordingly, recall responses are 
detected by culturing PBL from subjects that have been exposed to the antigen due to disease and thus have 
generated an immune response "naturally", or from patients who were vaccinated against the antigen. PBL from 
subjects are cultured in vitro for 1-2 weeks in the presence of test peptide plus antigen presenting cells (APC) to 
allow activation of "memory" T cells, as compared to "naive" T cells. At the end of the culture period, T cell 

30 activity is detected using assays including ^*Cr release involving peptide-sensitized targets, T cell proliferation, or 
lymphokine release. 

VL) 101P3A11 Transgenic Animals 

Nucleic acids that encode a 101P3A1 1 -related protein can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically useful 

35 reagents. In accordance with established techniques, cDNA encoding 101P3A1 1 can be used to clone genomic 
DNA that encodes 101P3A1 1. The cloned genomic sequences can then be used to generate transgenic animals 
containing cells that express DNA that encode 101P3A1 1. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
Nos. 4,736,866 issued 12 April 1988, and 4,870,009 issued 26 September 1989. Typically, particular cells would 

40 be targeted for 101P3A1 1 transgene incorporation with tissue-specific enhancers. 
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Transgenic animals that include a copy of a transgene encoding 101P3A1 1 can be used to examine the 
effect of increased expression of DNA that encodes 101P3A1 1 . Such animals can be used as tester animals for 
reagents thought to confer protection from, for example, pathological conditions associated with its overexpression. 
In accordance with this aspect of the invention, an animal is treated with a reagent and a reduced incidence of a 
5 pathological condition, compared to untreated animals that bear the transgene, would indicate a potential 
therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of 101P3A1 1 can be used to construct a 101P3A1 1 "knock out" 
animal that has a defective or altered gene encoding 101P3A1 1 as a result of homologous recombination between 
the endogenous gene encoding 101P3A1 1 and altered genomic DNA encoding 101P3A1 1 introduced into an 

10 embryonic cell of the animal For example, cDNA that encodes 101P3A1 1 can be used to clone genomic DNA 
encoding 101P3A1 1 in accordance with established techniques. A portion of the genomic DNA encoding 
101P3A1 1 can be deleted or replaced with another gene, such as a gene encoding a selectable marker that can be 
used to monitor integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are 
included in the vector (see, e.g., Thomas and Capecchi, Cell, 51:503 (1987) for a description of homologous 

15 recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and 
cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected (see, 
e.g.,, Li et al., Cell , 69:915 (1992)). The selected cells are then injected into a blastocyst of an animal (e.g., a mouse 
or rat) to form aggregation chimeras (see, e.g.,, Bradley, in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152). A chimeric embryo can then be 

20 implanted into a suitable pseudopregnant female foster animal, and the embryo brought to term to create a "knock 
out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be identified by 
standard techniques and used to breed animals in which all cells of the animal contain the homologously 
recombined DNA. Knock out animals can be characterized, for example, for their ability to defend against certain 
pathological conditions or for their development of pathological conditions due to absence of the 101P3A1 1 

25 polypeptide. 

VII.) Methods for the Detection of 101P3A11 

Another aspect of the present invention relates to methods for detecting 101P3A1 1 polynucleotides and 
101P3A1 1-related proteins, as well as methods for identifying a cell that expresses 101P3A1 1. The expression profile 
of 101P3A1 1 makes it a diagnostic marker for metastasized disease. Accordingly, the status of 101P3A1 1 gene 

30 products provides information useful for predicting a variety of factors including susceptibility to advanced stage 

disease, rate of progression, and/or tumor aggressiveness. As discussed in detail herein, the status of 101P3A1 1 gene 
products in patient samples can be analyzed by a variety protocols that are well known in the art including 
immunohistochemical analysis, the variety of Northern blotting techniques including in situ hybridization, RT-PCR 
analysis (for example on laser capture micro-dissected samples), Western blot analysis and tissue array analysis. 

35 More particularly, the invention provides assays for the detection of 101P3A1 1 polynucleotides in a biological 

sample, such as serum, bone, prostate, and other tissues, urine, semen, cell preparations, and the like. Detectable 
101P3A1 1 polynucleotides include, for example, a 101P3A1 1 gene or fragment thereof, 101P3A1 1 mRNA, alternative 
splice variant 101P3A1 1 mRNAs, and recombinant DNA or RNA molecules that contain a 101P3A1 1 polynucleotide. 
A number of methods for amplifying and/or detecting the presence of 101P3A1 1 polynucleotides are well known in the 

40 art and can be employed in the practice of this aspect of the invention. 
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In one embodiment, a method for detecting an 101P3A1 1 mRNA in a biological sample comprises producing 
cDNA from the sample by reverse transcription using at least one primer; amplifying the cDNA so produced using 
an 101P3A1 1 polynucleotides as sense and antisense primers to amplify 101P3A1 1 cDNAs therein; and detecting 
the presence of the amplified 101P3A1 1 cDNA. Optionally, the sequence of the amplified 101P3A1 1 cDNA can be 
5 determined. 

In another embodiment, a method of detecting a 101P3A1 1 gene in a biological sample comprises first 
isolating genomic DNA from the sample; amplifying the isolated genomic DNA using 101P3A1 1 polynucleotides 
as sense and antisense primers; and detecting the presence of the amplified 101P3A1 1 gene. Any number of 
appropriate sense and antisense probe combinations can be designed from the nucleotide sequence provided for the 

10 101P3A1 1 (Figure 2) and used for this purpose. 

The invention also provides assays for detecting the presence of an 101P3A1 1 protein in a tissue or other 
biological sample such as serum, semen, bone, prostate, urine, cell preparations, and the like. Methods for detecting a 
101P3A1 1 -related protein are also well known and include, for example, immunoprecipitation, immunohistochemical 
analysis, Western blot analysis, molecular binding assays, ELIS A, ELIFA and the like. For example, a method of 

1 5 detecting the presence of a 10 1P3A1 1 -related protein in a biological sample comprises first contacting the sample 
with a 101P3A1 1 antibody, a 101P3A1 1 -reactive fragment thereof, or a recombinant protein containing an antigen 
binding region of a 101P3A1 1 antibody; and then detecting the binding of 101P3A1 1-related protein in the sample. 

Methods for identifying a cell that expresses 101P3A1 1 are also within the scope of the invention. In one 
embodiment, an assay for identifying a cell that expresses a 101P3A1 1 gene comprises detecting the presence of 

20 101P3A1 1 mRNA in the cell. Methods for the detection of particular mRNAs in cells are well known and include, for 
example, hybridization assays using complementary DNA probes (such as in situ hybridization using labeled 101P3A1 1 
riboprobes, Northern blot and related techniques) and various nucleic acid amplification assays (such as RT-PCR using 
complementary primers specific for 101P3A1 1, and other amplification type detection methods, such as, for example, 
branched DNA, SISBA, TMA and the like). Alternatively, an assay for identifying a cell that expresses al01P3All 

25 gene comprises detecting the presence of 101P3A1 1-related protein in the cell or secreted by the cell. Various methods 
for the detection of proteins are well known in the art and are employed for the detection of 101P3A1 1-related proteins 
and cells that express 101P3A1 1-related proteins. 

101P3A1 1 expression analysis is also useful as a tool for identifying and evaluating agents that modulate 
101P3A1 1 gene expression. For example, 101P3A1 1 expression is significantly upregulated in prostate cancer, and 

30 is expressed in cancers of the tissues listed in Table I. Identification of a molecule or biological agent that inhibits 
101P3A1 1 expression or over-expression in cancer cells is of therapeutic value. For example, such an agent can be 
identified by using a screen that quantifies 101P3A1 1 expression by RT-PCR, nucleic acid hybridization or 
antibody binding. 

VIP.) Methods for Monitoring the Status of 101P3 All-related Genes and Their Products 

35 Oncogenesis is known to be a multistep process where cellular growth becomes progressively dysregulated 

and cells progress from a normal physiological state to precancerous and then cancerous states (see, e.g., Alers et 
al., Lab Invest. 77(5): 437-438 (1997) and Isaacs et al., Cancer Surv. 23: 19-32 (1995)). In this context, examining 
a biological sample for evidence of dysregulated cell growth (such as aberrant 101P3A1 1 expression in cancers) 
allows for early detection of such aberrant physiology, before a pathologic state such as cancer has progressed to a 

40 stage that therapeutic options are more limited and or the prognosis is worse. In such examinations, the status of 
101P3A1 1 in a biological sample of interest can be compared, for example, to the status of 101P3A1 1 in a 
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corresponding normal sample (e.g. a sample from that individual or alternatively another individual that is not 
affected by a pathology). An alteration in the status of 101P3A1 1 in the biological sample (as compared to the 
normal sample) provides evidence of dysregulated cellular growth. In addition to using a biological sample that is 
not affected by a pathology as a normal sample, one can also use a predetermined normative value such as a 
5 predetermined normal level of mRNA expression (see, e.g., Grever et al. s J. Comp. Neurol. 1996 Dec 9;376(2):306- 
14 and U.S. Patent No. 5,837,501) to compare 101P3A1 1 status in a sample. 

The term "status" in this context is used according to its art accepted meaning and refers to the condition or 
state of a gene and its products. Typically, skilled artisans use a number of parameters to evaluate the condition or state 
of a gene and its products. These include, but are not limited to the location of expressed gene products (including the 
1 0 location of 10 1P3 Al 1 expressing cells) as well as the level, and biological activity of expressed gene products (such as 
101P3A1 1 mRNA, polynucleotides and polypeptides). Typically, an alteration in the status of 101P3A1 1 comprises 
a change in the location of 101P3A1 1 and/or 101P3A1 1 expressing cells and/or an increase in 101P3A1 1 mRNA 
and/or protein expression. 

101P3A1 1 status in a sample can be analyzed by a number of means well known in the art, including without 

1 5 limitation, immunohistochemical analysis, in situ hybridization, RT-PCR analysis on laser capture micro-dissected 

samples, Western blot analysis, and tissue array analysis. Typical protocols for evaluating the status of the 101P3A1 1 
gene and gene products are found, for example in Ausubel et al. eds., 1995, Current Protocols In Molecular Biology, 
Units 2 (Northern Blotting), 4 (Southern Blotting), 15 (Immunoblotting) and 18 (PCR Analysis). Thus, the status of 
101P3A1 1 in a biological sample is evaluated by various methods utilized by skilled artisans including, but not 

20 limited to genomic Southern analysis (to examine, for example perturbations in the 101P3A1 1 gene), Northern 
analysis and/or PCR analysis of 101P3A1 1 mRNA (to examine, for example alterations in the polynucleotide 
sequences or expression levels of 101P3A1 1 mRNAs), and, Western and/or immunohistochemical analysis (to 
examine, for example alterations in polypeptide sequences, alterations in polypeptide localization within a sample, 
alterations in expression levels of 101P3A1 1 proteins and/or associations of 101P3A1 1 proteins with polypeptide 

25 binding partners). Detectable 101P3A1 1 polynucleotides include, for example, a 101P3A1 1 gene or fragment thereof, 
101P3A1 1 mRNA, alternative splice variants, 101P3A1 1 mRNAs, and recombinant DNA or RNA molecules 
containing a 101P3A11 polynucleotide. 

The expression profile of 101P3A1 1 makes it a diagnostic marker for local and/or metastasized disease, 
and provides information on the growth or oncogenic potential of a biological sample. In particular, the status of 

30 101P3A1 1 provides information useful for predicting susceptibility to particular disease stages, progression, and/or 
tumor aggressiveness. The invention provides methods and assays for determining 101P3A1 1 status and diagnosing 
cancers that express 101P3A1 1, such as cancers of the tissues listed in Table I. For example, because 101P3A1 1 
mRNA is so highly expressed in prostate and other cancers relative to normal prostate tissue, assays that evaluate the 
levels of 101P3A1 1 mRNA transcripts or proteins in a biological sample can be used to diagnose a disease associated 

35 with 101P3A1 1 dysregulation, and can provide prognostic information useful in defining appropriate therapeutic 
options. 

The expression status of 101P3A1 1 provides information including the presence, stage and location of 
dysplastic, precancerous and cancerous cells, predicting susceptibility to various stages of disease, and/or for gauging 
tumor aggressiveness. Moreover, the expression profile makes it useful as an imaging reagent for metastasized 
40 disease. Consequently, an aspect of the invention is directed to the various molecular prognostic and diagnostic 
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methods for examining the status of 101P3A1 1 in biological samples such as those from individuals suffering from, 
or suspected of suffering from a pathology characterized by dysregulated cellular growth, such as cancer. 

As described above, the status of 101P3A1 1 in a biological sample can be examined by a number of well- 
known procedures in the art. For example, the status of 101P3A1 1 in a biological sample taken from a specific 
5 location in the body can be examined by evaluating the sample for the presence or absence of 101P3A1 1 expressing 
cells (e.g. those that express 101P3A1 1 mRNAs or proteins). This examination can provide evidence of 
dysregulated cellular growth, for example, when 101P3A1 1 -expressing cells are found in a biological sample that 
does not normally contain such cells (such as a lymph node), because such alterations in the status of 101P3A1 1 in a 
biological sample are often associated with dysregulated cellular growth. Specifically, one indicator of 

10 dysregulated cellular growth is the metastases of cancer cells from an organ of origin (such as the prostate) to a 
different area of the body (such as a lymph node). In this context, evidence of dysregulated cellular growth is 
important for example because occult lymph node metastases can be detected in a substantial proportion of patients 
with prostate cancer, and such metastases are associated with known predictors of disease progression (see, e.g., 
Murphy et al., Prostate 42(4): 315-317 (2000);Su et al., Semin. Surg. Oncol. 18(1): 17-28 (2000) and Freeman et al., 

15 J Urol 1995 Aug 154(2 Pt l):474-8). 

In one aspect, the invention provides methods for monitoring 101P3A1 1 gene products by determining the 
status of 101P3A1 1 gene products expressed by cells from an individual suspected of having a disease associated 
with dysregulated cell growth (such as hyperplasia or cancer) and then comparing the status so determined to the 
status of 101P3A1 1 gene products in a corresponding normal sample. The presence of aberrant 101P3A1 1 gene 

20 products in the test sample relative to the normal sample provides an indication of the presence of dysregulated cell 
growth within the cells of the individual. 

In another aspect, the invention provides assays useful in determining the presence of cancer in an 
individual, comprising detecting a significant increase in 101P3A1 1 mRNA or protein expression in a test cell or 
tissue sample relative to expression levels in the corresponding normal cell or tissue. The presence of 101P3A1 1 

25 mRNA can, for example, be evaluated in tissue samples including but not limited to those listed in Table I. The 
presence of significant 101P3A1 1 expression in any of these tissues is useful to indicate the emergence, presence 
and/or severity of a cancer, since the corresponding normal tissues do not express 101P3A1 1 mRNA or express it at 
lower levels. 

In a related embodiment, 101P3A1 1 status is determined at the protein level rather than at the nucleic acid 
30 level. For example, such a method comprises determining the level of 101P3A1 1 protein expressed by cells in a test 
tissue sample and comparing the level so determined to the level of 101P3A1 1 expressed in a corresponding normal 
sample. In one embodiment, the presence of 101P3A1 1 protein is evaluated, for example, using 
immunohistochemical methods. 101P3A1 1 antibodies or binding partners capable of detecting 101P3A1 1 protein 
expression are used in a variety of assay formats well known in the art for this purpose. 
35 In a further embodiment, one can evaluate the status of 101P3A1 1 nucleotide and amino acid sequences in a 

biological sample in order to identify perturbations in the structure of these molecules. These perturbations can include 
insertions, deletions, substitutions and the like. Such evaluations are useful because perturbations in the nucleotide and 
amino acid sequences are observed in a large number of proteins associated with a growth dysregulated phenotype (see, 
e.g., Marrogi et al., 1999, J. Cutan. Pathol. 26(8):369-378). For example, a mutation in the sequence of 101P3A1 1 
40 may be indicative of the presence or promotion of a tumor. Such assays therefore have diagnostic and predictive value 
where a mutation in 101P3A1 1 indicates a potential loss of function or increase in tumor growth. 
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A wide variety of assays for observing perturbations in nucleotide and amino acid sequences are well known in 
the art. For example, the size and structure of nucleic acid or amino acid sequences of 101P3A1 1 gene products are 
observed by the Northern, Southern, Western, PCR and DNA sequencing protocols discussed herein. In addition, other 
methods for observing perturbations in nucleotide and amino acid sequences such as single strand conformation 
5 polymorphism analysis are well known in the art (see, e.g., U.S. Patent Nos. 5,382,510 issued 7 September 1999, and 
5,952,170 issued 17 January 1995). 

Additionally, one can examine the methylation status of the 101P3A1 1 gene in a biological sample. Aberrant 
demethylation and/or hypermethylation of CpG islands in gene 5' regulatory regions frequently occurs in immortalized 
and transformed cells, and can result in altered expression of various genes. For example, promoter hypermethylation 

10 of the pi-class glutathione S-transferase (a protein expressed in normal prostate but not expressed in >90% of 

prostate carcinomas) appears to permanently silence transcription of this gene and is the most frequently detected 
genomic alteration in prostate carcinomas (De Marzo et al., Am. J. Pathol. 155(6): 1985-1992 (1999)). In addition, 
this alteration is present in at least 70% of cases of high-grade prostatic intraepithelial neoplasia (PIN) (Brooks et al, 
Cancer Epidemiol. Biomarkers Prev., 1998, 7:531-536). In another example, expression of the LAGE-I tumor 

15 specific gene (which is not expressed in normal prostate but is expressed in 25-50% of prostate cancers) is induced 
by deoxy-azacytidine in lymphoblastoid cells, suggesting that tumoral expression is due to demethylation (Lethe et 
al., Int. J. Cancer 76(6): 903-908 (1998)). A variety of assays for examining methylation status of a gene are well 
known in the art. For example, one can utilize, in Southern hybridization approaches, methylation-sensitive restriction 
enzymes which cannot cleave sequences that contain methylated CpG sites to assess the methylation status of CpG 

20 islands. In addition, MSP (methylation specific PCR) can rapidly profile the methylation status of all the CpG sites 
present in a CpG island of a given gene. This procedure involves initial modification of DNA by sodium bisulfite 
(which will convert all unmethylated cytosines to uracil) followed by amplification using primers specific for 
methylated versus unmethylated DNA. Protocols involving methylation interference can also be found for example in 
Current Protocols In Molecular Biology, Unit 12, Frederick M. Ausubel et al. eds., 1995. 

25 Gene amplification is an additional method for assessing the status of 101P3A1 1. Gene amplification is 

measured in a sample directly, for example, by conventional Southern blotting or Northern blotting to quantitate the 
transcription of mRNA (Thomas, 1980, Proc. Natl. Acad. Sci. USA, 77:5201-5205), dot blotting (DNA analysis), or 
in situ hybridization, using an appropriately labeled probe, based on the sequences provided herein. Alternatively, 
antibodies are employed that recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 

30 hybrid duplexes or DNA-protein duplexes. The antibodies in turn are labeled and the assay carried out where the 

duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to 
the duplex can be detected. 

Biopsied tissue or peripheral blood can be conveniently assayed for the presence of cancer cells using for 
example, Northern, dot blot or RT-PCR analysis to detect 101P3A1 1 expression. The presence of RT-PCR amplifiable 

35 101P3A1 1 mRNA provides an indication of the presence of cancer. RT-PCR assays are well known in the art. RT- 
PCR detection assays for tumor cells in peripheral blood are currently being evaluated for use in the diagnosis and 
management of a number of human solid tumors. In the prostate cancer field, these include RT-PCR assays for the 
detection of cells expressing PSA and PSM (Verkaik et al, 1997, Urol. Res. 25:373-384; Ghossein et al., 1995, J. Clin. 
Oncol. 13:1195-2000; Hestonetal., 1995, Clin. Chem. 41:1687-1688). 

40 A further aspect of the invention is an assessment of the susceptibility that an individual has for developing 

cancer. In one embodiment, a method for predicting susceptibility to cancer comprises detecting 101P3A1 1 mRNA or 
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101P3A1 1 protein in a tissue sample, its presence indicating susceptibility to cancer, wherein the degree of 101P3A1 1 
mRNA expression correlates to the degree of susceptibility. In a specific embodiment, the presence ofl01P3All in 
prostate or other tissue is examined, with the presence of 101P3A1 1 in the sample providing an indication of prostate 
cancer susceptibility (or the emergence or existence of a prostate tumor). Similarly, one can evaluate the integrity 
5 101P3A1 1 nucleotide and amino acid sequences in a biological sample, in order to identify perturbations in the structure 
of these molecules such as insertions, deletions, substitutions and the like. The presence of one or more perturbations in 
101P3A1 1 gene products in the sample is an indication of cancer susceptibility (or the emergence or existence of a 
tumor). 

The invention also comprises methods for gauging tumor aggressiveness. In one embodiment, a method for 

10 gauging aggressiveness of a tumor comprises determining the level of 101P3A1 1 mRNA or 10 1P3A1 1 protein 

expressed by tumor cells, comparing the level so determined to the level of 101P3A1 1 mRNA or 101P3A1 1 protein 
expressed in a corresponding normal tissue taken from the same individual or a normal tissue reference sample, wherein 
the degree of 101P3A1 1 mRNA or 101P3A1 1 protein expression in the tumor sample relative to the normal sample 
indicates the degree of aggressiveness. In a specific embodiment, aggressiveness of a tumor is evaluated by detennining 

15 the extent to which 101P3A1 1 is expressed in the tumor cells, with higher expression levels indicating more aggressive 
tumors. Another embodiment is the evaluation of the integrity of 101P3A1 1 nucleotide and amino acid sequences in a 
biological sample, in order to identify perturbations in the structure of these molecules such as insertions, deletions, 
substitutions and the like. The presence of one or more perturbations indicates more aggressive tumors. 

Another embodiment of the invention is directed to methods for observing the progression of a malignancy in 

20 an individual over time. In one embodiment, methods for observing the progression of a malignancy in an individual 
over time comprise deterrnining the level of 101P3A1 1 mRNA or 101P3A1 1 protein expressed by cells in a sample of 
the tumor, comparing the level so determined to the level of 101P3A1 1 mRNA or 101P3A1 1 protein expressed in an 
equivalent tissue sample taken from the same individual at a different time, wherein the degree of 101P3A1 1 mRNA or 
101P3A1 1 protein expression in the tumor sample over time provides information on the progression of the cancer. In a 

25 specific embodiment, the progression of a cancer is evaluated by deterrnining 101P3A1 1 expression in the tumor cells 
over time, where increased expression over time indicates a progression of the cancer. Also, one can evaluate the 
integrity 101P3A1 1 nucleotide and amino acid sequences in a biological sample in order to identify perturbations in the 
structure of these molecules such as insertions, deletions, substitutions and the like, where the presence of one or more 
perturbations indicates a progression of the cancer. 

30 The above diagnostic approaches can be combined with any one of a wide variety of prognostic and diagnostic 

protocols known in the art. For example, another embodiment of the invention is directed to methods for observing a 
coincidence between the expression of 101P3A1 1 gene and 101P3A1 1 gene products (or perturbations in 101P3A1 1 
gene and 101P3A1 1 gene products) and a factor that is associated with malignancy, as a means for diagnosing and 
prognosticating the status of a tissue sample. A wide variety of factors associated with malignancy can be utilized, such 

35 as the expression of genes associated with malignancy (e.g. PSA, PSCA and PSM expression for prostate cancer etc.) as 
well as gross cytological observations (see, e.g., Bocking et al., 1984, Anal. Quant. Cytol. 6(2):74-88; Epstein, 1995, 
Hum. Pathol. 26(2):223-9; Thorson et al., 1998, Mod. Pathol. 1 1(6):543-51; Baisden et al., 1999, Am. J. Surg. 
Pathol. 23 (8): 9 18-24). Methods for observing a coincidence between the expression of 101P3A1 1 gene and 101P3A1 1 
gene products (or perturbations in 101P3A1 1 gene and 101P3A1 1 gene products) and another factor that is associated 

40 with malignancy are useful, for example, because the presence of a set of specific factors that coincide with disease 
provides information crucial for diagnosing and prognosticating the status of a tissue sample. 
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In one embodiment, methods for observing a coincidence between the expression of 101P3A1 1 gene and 
101P3A1 1 gene products (or perturbations in 101P3A1 1 gene and 101P3A1 1 gene products) and another factor 
associated with malignancy entails detecting the overexpression of 101P3A1 1 mRNA or protein in a tissue sample, 
detecting the overexpression of PSA mRNA or protein in a tissue sample (or PSCA or PSM expression), and observing 
5 a coincidence of 1 0 1P3 Al 1 mRNA or protein and PSA mRNA or protein overexpression (or PSCA or PSM 

expression). In a specific embodiment, the expression of 101P3A1 1 and PSA mRNA in prostate tissue is examined, 
where the coincidence of 101P3A1 1 and PSA mRNA overexpression in the sample indicates the existence of prostate 
cancer, prostate cancer susceptibility or the emergence or status of a prostate tumor. 

Methods for detecting and quantifying the expression of 101P3A1 1 mRNA or protein are described herein, and 

10 standard nucleic acid and protein detection and quantification technologies are well known in the art. Standard methods 
for the detection and quantification of 101P3A1 1 mRNA include in situ hybridization using labeled 101P3 Al 1 
riboprobes, Northern blot and related techniques using 101P3A1 1 polynucleotide probes, RT-PCR analysis using 
primers specific for 101P3A1 1, and other amplification type detection methods, such as, for example, branched DNA, 
SISBA, TMA and the like. In a specific embodiment, semi-quantitative RT-PCR is used to detect and quantify 

1 5 101P3A1 1 mRNA expression. Any number of primers capable of amplifying 101P3A1 1 can be used for this purpose, 
including but not limited to the various primer sets specifically described herein. In a specific embodiment, polyclonal 
or monoclonal antibodies specifically reactive with the wild- type 101P3A1 1 protein can be used in an 
immunohistochemical assay of biopsied tissue. 

IX.) Identification of Molecules that Interact with 101P3A1 1 

20 The 101P3A1 1 protein and nucleic acid sequences disclosed herein allow a skilled artisan to identify 

proteins, small molecules and other agents that interact with 101P3A1 1, as well as pathways activated by 
101P3A1 1 via any one of a variety of art accepted protocols. For example, one can utilize one of the so-called 
interaction trap systems (also referred to as the "two-hybrid assay"). In such systems, molecules interact and 
reconstitute a transcription factor which directs expression of a reporter gene, whereupon the expression of the 

25 reporter gene is assayed. Other systems identify protein-protein interactions in vivo through reconstitution of a 
eukaryotic transcriptional activator, see, e.g., U.S. Patent Nos. 5,955,280 issued 21 September 1999, 5,925,523 
issued 20 July 1999, 5,846,722 issued 8 December 1998 and 6,004,746 issued 21 December 1999. Algorithms are 
also available in the art for genome-based predictions of protein function (see, e.g., Marcotte, et al., Nature 402: 4 
November 1999, 83-86). 

30 Alternatively one can screen peptide libraries to identify molecules that interact with 101P3A1 1 protein 

sequences. In such methods, peptides that bind to 101P3A1 1 are identified by screening libraries that encode a 
random or controlled collection of amino acids. Peptides encoded by the libraries are expressed as fusion proteins 
of bacteriophage coat proteins, the bacteriophage particles are then screened against the 101P3A1 1 protein. 

Accordingly, peptides having a wide variety of uses, such as therapeutic, prognostic or diagnostic reagents, 

35 are thus identified without any prior information on the structure of the expected ligand or receptor molecule. 

Typical peptide libraries and screening methods that can be used to identify molecules that interact with 101P3A1 1 
protein sequences are disclosed for example in U.S. Patent Nos. 5,723,286 issued 3 March 1998 and 5,733,731 
issued 31 March 1998. 

Alternatively, cell lines that express 101P3A1 1 are used to identify protein-protein interactions mediated 
40 by 101P3A1 1. Such interactions can be examined using immunoprecipitation techniques (see, e.g., Hamilton B.J., 
et al. Biochem. Biophys. Res. Commun. 1999, 261 :646-51). 101P3A1 1 protein can be immunoprecipitated from 
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101P3A1 1-expressing cell lines using anti-101P3Al 1 antibodies. Alternatively, antibodies against His-tag can be 
used in a cell line engineered to express fusions of 101P3A1 1 and a His-tag (vectors mentioned above). The 
immunoprecipitated complex can be examined for protein association by procedures such as Western blotting, 35 S- 
methionine labeling of proteins, protein microsequencing, silver staining and two-dimensional gel electrophoresis. 
5 Small molecules and ligands that interact with 101P3A1 1 can be identified through related embodiments 

of such screening assays. For example, small molecules can be identified that interfere with protein function, 
including molecules that interfere with 101P3A1 l's ability to mediate phosphorylation and de-phosphorylation, 
interaction with DNA or RNA molecules as an indication of regulation of cell cycles, second messenger signaling 
or tumorigenesis. Similarly, small molecules that modulate 101P3A1 1 -related ion channel, protein pump, or cell 

10 communication functions are identified and used to treat patients that have a cancer that expresses 101P3A1 1 (see, 
e.g., Hille, B., Ionic Channels of Excitable Membranes 2 nd Ed., Sinauer Assoc., Sunderland, MA, 1992). Moreover, 
ligands that regulate 101P3A1 1 function can be identified based on their ability to bind 101P3A1 1 and activate a 
reporter construct. Typical methods are discussed for example in U.S. Patent No. 5,928,868 issued 27 July 1999, 
and include methods for forming hybrid ligands in which at least one ligand is a small molecule. In an illustrative 

15 embodiment, cells engineered to express a fusion protein of 101P3A1 1 and a DNA-binding protein are used to co- 
express a fusion protein of a hybrid ligand/small molecule and a cDNA library transcriptional activator protein. 
The cells further contain a reporter gene, the expression of which is conditioned on the proximity of the first and 
second fusion proteins to each other, an event that occurs only if the hybrid ligand binds to target sites on both 
hybrid proteins. Those ceils that express the reporter gene are selected and the unknown small molecule or the 

20 unknown ligand is identified. This method provides a means of identifying modulators which activate or inhibit 
101P3A11. 

An embodiment of this invention comprises a method of screening for a molecule that interacts with an 
101P3A1 1 amino acid sequence shown in Figure 2 or Figure 3, comprising the steps of contacting a population of 
molecules with the 101P3A1 1 amino acid sequence, allowing the population of molecules and the 101P3A1 1 amino 

25 acid sequence to interact under conditions that facilitate an interaction, determining the presence of a molecule that 
interacts with the 101P3A1 1 amino acid sequence, and then separating molecules that do not interact with the 
101P3A1 1 amino acid sequence from molecules that do. In a specific embodiment, the method further comprises 
purifying, characterizing and identifying a molecule that interacts with the 101P3A1 1 amino acid sequence. The 
identified molecule can be used to modulate a function performed by 101P3A1 1. In a preferred embodiment, the 

30 101P3A1 1 amino acid sequence is contacted with a library of peptides. 

Non-limiting examples of small molecules include compounds that bind members of the olfactory receptor 
family of GPCR., ligands including hormones, neuropeptides, chemokines, odorants, phospholipids, and functional 
equivalents thereof that bind and preferably inhibit 101P3A1 1 protein function. Such non-limiting small molecules 
preferably have a molecular weight of less than about 10 kDa, more preferably below about 9, about 8, about 7, 

35 about 6, about 5 or about 4 kDa; physically associates with, or binds, 101P3A1 1 protein; is not found in naturally 
occurring metabolic pathways; and/or are more soluble in aqueous than non-aqueous solutions. 

X.) Therapeutic Methods and Compositions 

The identification of 101P3A1 1 as a protein that is normally expressed in a restricted set of tissues, but 
40 which is also expressed in prostate and other cancers, opens a number of therapeutic approaches to the treatment of 
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such cancers. As contemplated herein, 101P3A1 1 functions as a transcription factor involved in activating tumor- 
promoting genes or repressing genes that block tumorigenesis. 

Accordingly, therapeutic approaches that inhibit the activity of the 101P3A1 1 protein are useful for 
patients suffering from a cancer that expresses 101P3A1 1. These therapeutic approaches generally fall into two 
5 classes. One class comprises various methods for inhibiting the binding or association of the 101P3A1 1 protein 
with its binding partner or with other proteins. Another class comprises a variety of methods for inhibiting the 
transcription of the 101P3A1 1 gene or translation of 101P3A1 1 mRNA. 

X.A.) Anti-Cancer Vaccines 

The invention provides cancer vaccines comprising a 101P3A1 1-related protein or 101P3A1 1 -related nucleic 
1 0 acid. In view of the expression of 1 0 1 P3 Al 1 , cancer vaccines prevent and/or treat 1 0 1 P3 A 1 1 -expressing cancers with 
minimal or no effects on non-target tissues. The use of a tumor antigen in a vaccine that generates humoral and/or cell- 
mediated immune responses as anti-cancer therapy is well known in the art and has been employed in prostate cancer 
using human PSMA and rodent PAP immunogens (Hodge et al., 1995, Int. J. Cancer 63:231-237; Fong et al., 1997, J. 
Immunol. 159:3113-3117). 

15 Such methods can be readily practiced by employing a 101P3A1 1-related protein, or an 101P3A1 1- 

encoding nucleic acid molecule and recombinant vectors capable of expressing and presenting the 101P3A1 1 
immunogen (which typically comprises a number of antibody or T cell epitopes). Skilled artisans understand that a 
wide variety of vaccine systems for delivery of immunoreactive epitopes are known in the art (see, e.g., Heryln et 
al., Ann Med 1999 Feb 31(l):66-78; Maruyama et al, Cancer Immunol Immunother 2000 Jun 49(3):123-32) 

20 Briefly, such methods of generating an immune response (e.g. humoral and/or cell-mediated) in a mammal, 
comprise the steps of: exposing the mammal's immune system to an immunoreactive epitope (e.g. an epitope 
present in the 101P3A1 1 protein shown in SEQ ID NO: 703 or analog or homolog thereof) so that the mammal 
generates an immune response that is specific for that epitope (e.g. generates antibodies that specifically recognize 
that epitope). In a preferred method, the 101P3A1 1 immunogen contains a biological motif, see e.g., Tables V- 

25 XVIII or XXVI-XXVIII, or a peptide of a size range from 101P3A1 1 indicated in Figure 5, Figure 6, Figure 7, 
Figure 8, and Figure 9. 

The entire 101P3A1 1 protein, immunogenic regions or epitopes thereof can be combined and delivered by 
various means. Such vaccine compositions can include, for example, lipopeptides (e.g.,Vitie\\o, A. et al., J. Clin. 
Invest. 95:341, 1995), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 

30 {see, e.g., Eldridge, et al., Molec. Immunol. 28:287-294, 1991: Alonso et al, Vaccine 12:299-306, 1994; Jones et 

al, Vaccine 13:675-681, 1995), peptide compositions contained in immune stimulating complexes (ISCOMS) (see, 
e.g., Takahashi et al, Nature 344:873-875, 1990; Hu et al., Clin Exp Immunol. 1 13:235-243, 1998), multiple 
antigen peptide systems (MAPs) (see e.g., Tam, J. P., Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413, 1988; Tarn, J.P., 
J. Immunol. Methods 196:17-32, 1996), peptides formulated as multivalent peptides; peptides for use in ballistic 

35 delivery systems, typically crystallized peptides, viral delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine 
development, Kaufmann, S. H. E., ed., p. 379, 1996; Chakrabarti, S. et al., Nature 320:535, 1986; Hu, S. L. et al., 
Nature 320:537, 1986; Kieny, M.-P. et al., AIDS Bio/Technology 4:790, 1986; Top, F. H. et al., J. Infect. Dis. 
124:148, 1971; Chanda, P. K. et al., Virology 175:535, 1990), particles of viral or synthetic origin (e.g., Kofler, N. 
etal.,J. Immunol. Methods. 192:25, 1996; Eldridge, J. H. etal, Sem. Hematol. 30:16, 1993; Falo, L. D., Jr. etal, 

40 Nature Med. 7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol. 4:369, 
1986; Gupta, R. K. et al, Vaccine 1 1:293, 1993), liposomes (Reddy, R. et al., J. Immunol. 148:1585, 1992; Rock, 
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K. L., Immunol Today 17:131, 1996), or, naked or particle absorbed cDNA (Ulmer, J. B. et aL, Science 259:1745, 
1993; Robinson, H. L., Hunt, L. A., and Webster, R. G., Vaccine 1 1:957, 1993; Shiver, J. W. et aL, In: Concepts in 
vaccine development, Kaufmann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofsky, J. A., Annu. Rev. 
Immunol 12:923, 1994 and Eldridge, J. H. et aL, Sem. HematoL 30:16, 1993). Toxin-targeted delivery 
5 technologies, also known as receptor mediated targeting, such as those of Avant Immunotherapeutics, Inc. 
(Needham, Massachusetts) may also be used. 

In patients with 101P3A1 1 -associated cancer, the vaccine compositions of the invention can also be used 
in conjunction with other treatments used for cancer, e.g., surgery, chemotherapy, drug therapies, radiation 
therapies, etc. including use in combination with immune adjuvants such as IL-2, IL-12, GM-CSF, and the like. 

10 Cellular Vaccines: 

CTL epitopes can be determined using specific algorithms to identify peptides within 101P3A1 1 protein that 
bind corresponding HLA alleles (see e.g., Table IV; Epimer™ and Epimatrix™, Brown University (URL 
www.brown.edu/Research/TB-HW_I^b/epta and, BIMAS, (URL bimas.dcrt.nih.gov/; 

SYFPEITHI at URL syfpeithi.bmi-heidelberg.com/). In a preferred embodiment, the 101P3A1 1 immunogen 

1 5 contains one or more amino acid sequences identified using techniques well known in the art, such as the sequences 
shown in Tables V-XVIII or XXVI-XXVIII or a peptide of 8, 9, 10 or 1 1 amino acids specified by an HLA Class I 
motif/supermotif (e.g., Table IV (A), Table IV (D), or Table IV (E)) and/or a peptide of at least 9 amino acids that 
comprises an HLA Class II motif/supermotif (e.g., Table IV (B) or Table IV (C)). As is appreciated in the art, the 
HLA Class I binding groove is essentially closed ended so that peptides of only a particular size range can fit into 

20 the groove and be bound, generally HLA Class I epitopes are 8, 9, 10, or 1 1 amino acids long. In contrast, the HLA 
Class II binding groove is essentially open ended; therefore a peptide of about 9 or more amino acids can be bound 
by an HLA Class II molecule. Due to the binding groove differences between HLA Class I and II, HLA Class I 
motifs are length specific, i.e., position two of a Class I motif is the second amino acid in an amino to carboxyl 
direction of the peptide. The amino acid positions in a Class II motif are relative only to each other, not the overall 

25 peptide, i.e., additional amino acids can be attached to the amino and/or carboxyl termini of a motif-bearing 

sequence. HLA Class II epitopes are often 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino 
acids long, or longer than 25 amino acids. 
Antibody-based Vaccines 

A wide variety of methods for generating an immune response in a mammal are known in the art (for 
30 example as the first step in the generation of hybridomas). Methods of generating an immune response in a 
mammal comprise exposing the mammal's immune system to an immunogenic epitope on a protein (e.g. the 
101P3A1 1 protein) so that an immune response is generated. In one embodiment the protein can be an anti 
idiotypic antibody. A typical embodiment consists of a method for generating an immune response to 101P3A1 1 in 
a host, by contacting the host with a sufficient amount of at least one 101 P3 All B cell or cytotoxic T-cell epitope 
35 or analog thereof; and at least one periodic interval thereafter re-contacting the host with the 101 P3 All B cell or 
cytotoxic T-cell epitope or analog thereof. A specific embodiment consists of a method of generating an immune 
response against a 1 01P3A1 1 -related protein or a man-made multiepitopic peptide comprising: administering 
101P3A1 1 immunogen (e.g. the 101P3A1 1 protein or a peptide fragment thereof, an 101P3A1 1 fusion protein or 
analog etc.) in a vaccine preparation to a human or another mammal. Typically, such vaccine preparations further 
40 contain a suitable adjuvant (see, e.g., U.S. Patent No. 6,146,635) or a universal helper epitope such as a PADRE™ 
peptide (Epimmune Inc., San Diego, CA; see, e.g., Alexander et al., J. Immunol. 2000 164(3); 164(3): 1625-1633; 
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Alexander et al., Immunity 1994 1(9): 751-761 and Alexander et al., Immunol. Res. 1998 18(2): 79-92). An 
alternative method comprises generating an immune response in an individual against a 101P3A1 1 immunogen by: 
administering in vivo to muscle or skin of the individual's body a DNA molecule that comprises a DNA sequence 
that encodes an 101P3A1 1 immunogen, the DNA sequence operatively linked to regulatory sequences which 
5 control the expression of the DNA sequence; wherein the DNA molecule is taken up by cells, the DNA sequence is 
expressed in the cells and an immune response is generated against the immunogen (see, e.g., U.S. Patent No. 
5,962,428). Optionally a genetic vaccine facilitator such as anionic lipids; saponins; lectins; estrogenic compounds; 
hydroxylated lower alky Is; dimethyl sulfoxide; and urea is also administered. 
Nucleic Acid Vaccines: 

10 Vaccine compositions of the invention include nucleic acid-mediated modalities. DNA or RNA that 

encode protein(s) of the invention can be administered to a patient. Genetic immunization methods can be 
employed to generate prophylactic or therapeutic humoral and cellular immune responses directed against cancer 
cells expressing 101P3A1 1. Constructs comprising DNA encoding a 101P3A1 1-related protein/immunogen and 
appropriate regulatory sequences can be injected directly into muscle or skin of an individual, such that the cells of 

15 the muscle or skin take-up the construct and express the encoded 101P3A1 1 protein/immunogen. Alternatively, a 
vaccine comprises a 10 1P3A1 1-related protein. Expression of the 101P3A1 1-related protein immunogen results in 
the generation of prophylactic or therapeutic humoral and cellular immunity against cells that bear 101P3A1 1 
protein. Various prophylactic and therapeutic genetic immunization techniques known in the art can be used (for 
review, see information and references published at Internet address www.genweb.com). Nucleic acid-based 

20 delivery is described, for instance, in Wolffs, al., Science 247:1465 (1990) as well as U.S. Patent Nos. 5,580,859; 
5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 98/04720. Examples of DNA-based delivery 
technologies include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, proteins of the invention can be expressed via viral 

25 or bacterial vectors. Various viral gene delivery systems that can be used in the practice of the invention include, but 
are not limited to, vaccinia, fowlpox, canarypox, adenovirus, influenza, poliovims, adeno-associated virus, lentivirus, 
and sindbis virus (see, e.g., Restifo, 1996, Curr. Opin. Immunol. 8:658-663; Tsang et al. J. Natl. Cancer Inst. 87:982-990 
(1995)). Non-viral delivery systems can also be employed by introducing naked DNA encoding a 101P3A1 1-related 
protein into the patient (e.g., intramuscularly or intradermally) to induce an anti-tumor response. 

30 Vaccinia virus is used, for example, as a vector to express nucleotide sequences that encode the peptides of 

the invention. Upon introduction into a host, the recombinant vaccinia virus expresses the protein immunogenic 
peptide, and thereby elicits a host immune response. Vaccinia vectors and methods useful in immunization 
protocols are described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). 
BCG vectors are described in Stover et al, Nature 35 1 :456-460 (1991). A wide variety of other vectors useful for 

35 therapeutic administration or immunization of the peptides of the invention, e.g. adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, will be apparent 
to those skilled in the art from the description herein. 

Thus, gene delivery systems are used to deliver a 101P3A1 1-related nucleic acid molecule. In one 
embodiment, the full-length human 101P3A1 1 cDNA is employed. In another embodiment, 101P3A1 1 nucleic acid 

40 molecules encoding specific cytotoxic T lymphocyte (CTL) and/or antibody epitopes are employed. 
Ex Vivo Vaccines 
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Various ex vivo strategies can also be employed to generate an immune response. One approach involves the 
use of antigen presenting cells (APCs) such as dendritic cells (DC) to present 101P3A1 1 antigen to a patient's immune 
system. Dendritic cells express MHC class I and II molecules, B7 co-stimulator, and IL-12, and are thus highly 
specialized antigen presenting cells. In prostate cancer, autologous dendritic cells pulsed with peptides of the 
5 prostate-specific membrane antigen (PSMA) are being used in a Phase I clinical trial to stimulate prostate cancer 
patients' immune systems (Tjoa et al., 1996, Prostate 28:65-69; Murphy et al., 1996, Prostate 29:371-380). Thus, 
dendritic cells can be used to present 101P3A1 1 peptides to T cells in the context of MHC class I or II molecules. 
In one embodiment, autologous dendritic cells are pulsed with 101P3A1 1 peptides capable of binding to MHC class 
I and/or class II molecules. In another embodiment, dendritic cells are pulsed with the complete 101P3A1 1 protein. 

10 Yet another embodiment involves engineering the overexpression of the 101P3A1 1 gene in dendritic cells using 
various implementing vectors known in the art, such as adenovirus (Arthur et al., 1997, Cancer Gene Ther. 4: 17- 
25), retrovirus (Henderson et al., 1996, Cancer Res. 56:3763-3770), lentivirus, adeno-associated virus, DNA 
transfection (Ribas et al., 1997, Cancer Res. 57:2865-2869), or tumor-derived RNA transfection (Ashley et al., 
1997, J. Exp. Med. 186:1 177-1 182). Cells that express 101P3A1 1 can also be engineered to express immune 

15 modulators, such as GM-CSF, and used as immunizing agents. 


X.B.) 101P3A11 as a Target for Antibody-based Therapy 

101P3A1 1 is an attractive target for antibody-based therapeutic strategies. A number of antibody 
strategies are known in the art for targeting both extracellular and intracellular molecules (e.g., complement and 

20 ADCC mediated killing, the use of intrabodies, and conjugated antibodies) 101P3A1 1 is expressed by cancer cells 
of various lineages relative to corresponding normal cells, systemic administration of 101P3A1 1-immunoreactive 
compositions are prepared that exhibit excellent sensitivity without toxic, non-specific and/or non-target effects 
caused by binding of the immunoreactive composition to non-target organs and tissues. Antibodies specifically 
reactive with domains of 101P3A1 1 are useful to treat 101P3A1 1-expressing cancers systemically, either as 

25 conjugates with a toxin or therapeutic agent, or as naked antibodies capable of inhibiting cell proliferation or 
function. 

101P3A1 1 antibodies can be introduced into a patient such that the antibody binds to 101P3A1 1 and 
modulates a function, such as an interaction with a binding partner, and consequently mediates destruction of the 
tumor cells and/or inhibits the growth of the tumor cells. Mechanisms by which such antibodies exert a therapeutic 

30 effect can include complement-mediated cytolysis, antibody-dependent cellular cytotoxicity, modulation of the 

physiological function of 101P3A1 1, inhibition of ligand binding or signal transduction pathways, modulation of 
tumor cell differentiation, alteration of tumor angiogenesis factor profiles, and/or apoptosis. 

Those skilled in the art understand that antibodies can be used to specifically target and bind immunogenic 
molecules such as an immunogenic region of the 101P3A1 1 sequence shown in Figure 2 or Figure 3. In addition, 

35 skilled artisans understand that it is routine to conjugate antibodies to cytotoxic agents (see, e.g., S levers et al. 

Blood 93:1 1 3678-3684 (June 1, 1999)). When cytotoxic and/or therapeutic agents are delivered directly to cells, 
such as by conjugating them to antibodies specific for a molecule expressed by that cell (e.g. 101P3A1 1), the 
cytotoxic agent will exert its known biological effect (i.e. cytotoxicity) on those cells. 

A wide variety of compositions and methods for using antibody-cytotoxic agent conjugates to kill cells are 

40 known in the art. In the context of cancers, typical methods entail administering to an animal having a tumor a 
biologically effective amount of a conjugate comprising a selected cytotoxic and/or therapeutic agent linked to a 
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targeting agent (e.g. an anti-101P3Al 1 antibody) that binds to a marker (e.g. 101P3A1 1) expressed, accessible to 
binding or localized on the cell surfaces. A typical embodiment is a method of delivering a cytotoxic and/or 
therapeutic agent to a cell expressing 101P3A1 1, comprising conjugating the cytotoxic agent to an antibody that 
immunospecifically binds to a 101P3A1 1 epitope, and, exposing the cell to the antibody-agent conjugate. Another 
5 illustrative embodiment is a method of treating an individual suspected of suffering from metastasized cancer, 
comprising a step of administering parenterally to said individual a pharmaceutical composition comprising a 
therapeutically effective amount of an antibody conjugated to a cytotoxic and/or therapeutic agent. 

Cancer immunotherapy using anti-101P3Al 1 antibodies can be done in accordance with various 
approaches that have been successfully employed in the treatment of other types of cancer, including but not limited 

10 to colon cancer (Arlen et al., 1998, Crit. Rev. Immunol. 18:133-138), multiple myeloma (Ozaki et al., 1997, Blood 
90:3179-3186, Tsunenari et al., 1997, Blood 90:2437-2444), gastric cancer (Kasprzyk et al., 1992, Cancer Res. 
52:2771-2776), B-cell lymphoma (Funakoshi et al., 1996, J. Immunother. Emphasis Tumor Immunol. 19:93-101), 
leukemia (Zhong et al., 1996, Leuk. Res. 20:581-589), colorectal cancer (Moun et al., 1994, Cancer Res. 54:6160- 
6166; Velders et al., 1995, Cancer Res. 55:4398-4403), and breast cancer (Shepard et al., 1991, J. Clin. Immunol. 

15 11:1 17-127). Some therapeutic approaches involve conjugation of naked antibody to a toxin or radioisotope, such 
as the conjugation of Y 91 or I 131 to anti-CD20 antibodies (e.g., Zevalin™, IDEC Pharmaceuticals Corp. or Bexxar™, 
Coulter Pharmaceuticals), while others involve co-administration of antibodies and other therapeutic agents, such as 
Herceptin™ (trastuzumab) with paclitaxel (Genentech, Inc.). To treat prostate cancer, for example, 101P3A1 1 
antibodies can be administered in conjunction with radiation, chemotherapy or hormone ablation. 

20 Although 101P3A1 1 antibody therapy is useful for all stages of cancer, antibody therapy can be 

particularly appropriate in advanced or metastatic cancers. Treatment with the antibody therapy of the invention is 
indicated for patients who have received one or more rounds of chemotherapy. Alternatively, antibody therapy of 
the invention is combined with a chemotherapeutic or radiation regimen for patients who have not received 
chemotherapeutic treatment. Additionally, antibody therapy can enable the use of reduced dosages of concomitant 

25 chemotherapy, particularly for patients who do not tolerate the toxicity of the chemotherapeutic agent very well. 
Fan et al. (Cancer Res. 53:4637-4642, 1993), Prewett et al. (International J. of Onco. 9:217-224, 1996), and 
Hancock et al. (Cancer Res. 51:4575-4580, 1991) describe the use of various antibodies and chemotherapeutic 
agents in vitro. 

Cancer patients can be evaluated for the presence and level of 101P3A1 1 expression, preferably using 
30 immunohistochemical assessments of tumor tissue, quantitative 101P3A1 1 imaging, or other techniques that 
reliably indicate the presence and degree of 101P3A1 1 expression. Immunohistochemical analysis of tumor 
biopsies or surgical specimens is preferred for this purpose. Methods for immunohistochemical analysis of tumor 
tissues are well known in the art. 

Anti-101P3A1 1 monoclonal antibodies that treat prostate and other cancers include those that initiate a 
35 potent immune response against the tumor or those that are directly cytotoxic. In this regard, anti-101P3Al 1 

monoclonal antibodies (mAbs) can elicit tumor cell lysis by either complement-mediated or antibody-dependent 
cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc portion of the immunoglobulin molecule 
for interaction with effector cell Fc receptor sites on complement proteins. In addition, anti-101P3Al 1 mAbs that 
exert a direct biological effect on tumor growth are useful to treat cancers that express 101P3A1 1. Mechanisms by 
40 which directly cytotoxic mAbs act include: inhibition of cell growth, modulation of cellular differentiation, 

modulation of tumor angiogenesis factor profiles, and the induction of apoptosis. The mechanism(s) by which a 
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particular anti-101P3Al 1 mAb exerts an anti-tumor effect is evaluated using any number of in vitro assays that 
evaluate cell death such as ADCC, ADMMC, complement-mediated cell lysis, and so forth, as is generally known 
in the art. 

In some patients, the use of murine or other non-human monoclonal antibodies, or human/mouse chimeric 
5 mAbs can induce moderate to strong immune responses against the non-human antibody. This can result in 
clearance of the antibody from circulation and reduced efficacy. In the most severe cases, such an immune 
response can lead to the extensive formation of immune complexes which, potentially, can cause renal failure. 
Accordingly, preferred monoclonal antibodies used in the therapeutic methods of the invention are those that are 
either fully human or humanized and that bind specifically to the target 101P3A1 1 antigen with high affinity but 

10 exhibit low or no antigenicity in the patient. 

Therapeutic methods of the invention contemplate the administration of single anti-101P3All mAbs as 
well as combinations, or cocktails, of different mAbs. Such mAb cocktails can have certain advantages inasmuch 
as they contain mAbs that target different epitopes, exploit different effector mechanisms or combine directly 
cytotoxic mAbs with mAbs that rely on immune effector functionality. Such mAbs in combination can exhibit 

15 synergistic therapeutic effects. In addition, anti-101P3Al 1 mAbs can be administered concomitantly with other 
therapeutic modalities, including but not limited to various chemotherapeutic agents, androgen-blockers, immune 
modulators (e.g., IL-2, GM-CSF), surgery or radiation. The anti-101P3Al 1 mAbs are administered in their "naked" 
or unconjugated form, or can have a therapeutic agent(s) conjugated to them. 

Anti-101P3A1 1 antibody formulations are administered via any route capable of delivering the antibodies 

20 to a tumor cell. Routes of administration include, but are not limited to, intravenous, intraperitoneal, intramuscular, 
intratumor, intradermal, and the like. Treatment generally involves repeated administration of the anti-101P3Al 1 
antibody preparation, via an acceptable route of administration such as intravenous injection (IV), typically at a 
dose in the range of about 0.1, .2, .3, .4, .5, .6, .7, .8, .9., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or 25 mg/kg body weight. 
In general, doses in the range of 10-1000 mg mAb per week are effective and well tolerated. 

25 Based on clinical experience with the Herceptin mAb in the treatment of metastatic breast cancer, an initial 

loading dose of approximately 4 mg/kg patient body weight IV, followed by weekly doses of about 2 mg/kg IV of 
the anti- 101P3A1 1 mAb preparation represents an acceptable dosing regimen. Preferably, the initial loading dose 
is administered as a 90 minute or longer infusion. The periodic maintenance dose is administered as a 30 minute or 
longer infusion, provided the initial dose was well tolerated. As appreciated by those of skill in the art, various 

30 factors can influence the ideal dose regimen in a particular case. Such factors include, for example, the binding 
affinity and half life of the Ab or mAbs used, the degree of 101P3A1 1 expression in the patient, the extent of 
circulating shed 101P3A1 1 antigen, the desired steady-state antibody concentration level, frequency of treatment, 
and the influence of chemotherapeutic or other agents used in combination with the treatment method of the 
invention, as well as the health status of a particular patient. 

35 Optionally, patients should be evaluated for the levels of 101P3A1 1 in a given sample (e.g. the levels of 

circulating 101P3A1 1 antigen and/or 101P3A1 1 expressing cells) in order to assist in the determination of the most 
effective dosing regimen, etc. Such evaluations are also used for monitoring purposes throughout therapy, and are 
useful to gauge therapeutic success in combination with the evaluation of other parameters (for example, urine 
cytology and/or ImmunoCyt levels in bladder cancer therapy, or by analogy, serum PSA levels in prostate cancer 

40 therapy). 
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Anti-idiotypic anti-101P3Al 1 antibodies can also be used in anti-cancer therapy as a vaccine for inducing 
an immune response to cells expressing a 101P3A1 1 -related protein. In particular, the generation of anti-idiotypic 
antibodies is well known in the art; this methodology can readily be adapted to generate anti-idiotypic anti- 
101P3A1 1 antibodies that mimic an epitope on a 101P3A1 1-related protein (see, for example, Wagner et al., 1997, 
5 Hybridoma 16: 33-40; Foon et al, 1995, J. Clin. Invest. 96:334-342; Herlyn et al., 1996, Cancer Immunol. 
Immunother. 43:65-76). Such an anti-idiotypic antibody can be used in cancer vaccine strategies. 
X.C.) 101P3A11 as a Tareet for Cellular Immune Responses 

Vaccines and methods of preparing vaccines that contain an immunogenically effective amount of one or 
more HLA-binding peptides as described herein are further embodiments of the invention. Furthermore, vaccines 

10 in accordance with the invention encompass compositions of one or more of the claimed peptides. A peptide can be 
present in a vaccine individually. Alternatively, the peptide can exist as a homopolymer comprising multiple copies 
of the same peptide, or as a heteropolymer of various peptides. Polymers have the advantage of increased 
immunological reaction and, where different peptide epitopes are used to make up the polymer, the additional 
ability to induce antibodies and/or CTLs that react with different antigenic determinants of the pathogenic organism 

15 or tumor-related peptide targeted for an immune response. The composition can be a naturally occurring region of 
an antigen or can be prepared, e.g., recombinantly or by chemical synthesis. 

Carriers that can be used with vaccines of the invention are well known in the art, and include, e.g., 
thyroglobulin, albumins such as human serum albumin, tetanus toxoid, polyamino acids such as poly L-lysine, poly 
L-glutamic acid, influenza, hepatitis B virus core protein, and the like. The vaccines can contain a physiologically 

20 tolerable (i.e., acceptable) diluent such as water, or saline, preferably phosphate buffered saline. The vaccines also 
typically include an adjuvant. Adjuvants such as incomplete Freund's adjuvant, aluminum phosphate, aluminum 
hydroxide, or alum are examples of materials well known in the art. Additionally, as disclosed herein, CTL 
responses can be primed by conjugating peptides of the invention to lipids, such as tripalmitoyl-S- 
glycerylcysteinlyseryl- serine (P 3 CSS). Moreover, an adjuvant such as a synthetic cytosine-phosphorothiolated- 

25 guanine-containing (CpG) oligonucleotides has been found to increase CTL responses 10- to 100-fold, (see, e.g. 
Davila and Celis J. Immunol. 165:539-547 (2000)) 

Upon immunization with a peptide composition in accordance with the invention, via injection, aerosol, 
oral, transdermal, transmucosal, intrapleural, intrathecal, or other suitable routes, the immune system of the host 
responds to the vaccine by producing large amounts of CTLs and/or HTLs specific for the desired antigen. 

30 Consequently, the host becomes at least partially immune to later development of cells that express or overexpress 
101P3A1 1 antigen, or derives at least some therapeutic benefit when the antigen was tumor- associated. 

In some embodiments, it may be desirable to combine the class I peptide components with components 
that induce or facilitate neutralizing antibody and or helper T cell responses directed to the target antigen. A 
preferred embodiment of such a composition comprises class I and class II epitopes in accordance with the 

35 invention. An alternative embodiment of such a composition comprises a class I and/or class II epitope in 

accordance with the invention, along with a cross reactive HTL epitope such as PADRE™ (Epimmune, San Diego, 
CA) molecule (described e.g., in U.S. Patent Number 5,736,142). 

A vaccine of the invention can also include antigen-presenting cells (APC), such as dendritic cells (DC), as 
a vehicle to present peptides of the invention. Vaccine compositions can be created in vitro, following dendritic cell 

40 mobilization and harvesting, whereby loading of dendritic cells occurs in vitro. For example, dendritic cells are 

transfected, e.g., with a minigene in accordance with the invention, or are pulsed with peptides. The dendritic cell 
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can then be administered to a patient to elicit immune responses in vivo. Vaccine compositions, either DNA- or 
peptide-based, can also be administered in vivo in combination with dendritic cell mobilization whereby loading of 
dendritic cells occurs in vivo. 

Preferably, the following principles are utilized when selecting an array of epitopes for inclusion in a 
5 polyepitopic composition for use in a vaccine, or for selecting discrete epitopes to be included in a vaccine and/or to 
be encoded by nucleic acids such as a minigene. It is preferred that each of the following principles be balanced in 
order to make the selection. The multiple epitopes to be incorporated in a given vaccine composition may be, but 
need not be, contiguous in sequence in the native antigen from which the epitopes are derived. 

1 .) Epitopes are selected which, upon administration, mimic immune responses that have been 

10 observed to be correlated with tumor clearance. For HLA Class I this includes 3-4 epitopes that come from at least 
one tumor associated antigen (TAA). For HLA Class II a similar rationale is employed; again 3-4 epitopes are 
selected from at least one TAA (see, e.g., Rosenberg et ah, Science 278:1447-1450). Epitopes from one TAA may 
be used in combination with epitopes from one or more additional TAAs to produce a vaccine that targets tumors 
with varying expression patterns of frequently-expressed TAAs. 

15 2.) Epitopes are selected that have the requisite binding affinity established to be correlated with 

immunogenicity: for HLA Class I an IC 50 of 500 nM or less, often 200 nM or less; and for Class II an IC 50 of 1000 
nM or less. 

3. ) Sufficient supermotif bearing-peptides, or a sufficient array of allele-specific motif-bearing 
peptides, are selected to give broad population coverage. For example, it is preferable to have at least 80% 

20 population coverage. A Monte Carlo analysis, a statistical evaluation known in the art, can be employed to assess 
the breadth, or redundancy of, population coverage. 

4. ) When selecting epitopes from cancer-related antigens it is often useful to select analogs because 
the patient may have developed tolerance to the native epitope. 

5. ) Of particular relevance are epitopes referred to as "nested epitopes." Nested epitopes occur where 
25 at least two epitopes overlap in a given peptide sequence. A nested peptide sequence can comprise B cell, HLA 

class I and/or HLA class II epitopes. When providing nested epitopes, a general objective is to provide the greatest 
number of epitopes per sequence. Thus, an aspect is to avoid providing a peptide that is any longer than the amino 
terminus of the amino terminal epitope and the carboxyl terminus of the carboxyl terminal epitope in the peptide. 
When providing a multi-epitopic sequence, such as a sequence comprising nested epitopes, it is generally important 
30 to screen the sequence in order to insure that it does not have pathological or other deleterious biological properties. 

6. ) If a polyepitopic protein is created, or when creating a minigene, an objective is to generate the 
smallest peptide that encompasses the epitopes of interest. This principle is similar, if not the same as that 
employed when selecting a peptide comprising nested epitopes. However, with an artificial polyepitopic peptide, 
the size minimization objective is balanced against the need to integrate any spacer sequences between epitopes in 

35 the polyepitopic protein. Spacer amino acid residues can, for example, be introduced to avoid junctional epitopes 
(an epitope recognized by the immune system, not present in the target antigen, and only created by the man-made 
juxtaposition of epitopes), or to facilitate cleavage between epitopes and thereby enhance epitope presentation. 
Junctional epitopes are generally to be avoided because the recipient may generate an immune response to that non- 
native epitope. Of particular concern is a junctional epitope that is a "dominant epitope." A dominant epitope may 

40 lead to such a zealous response that immune responses to other epitopes are diminished or suppressed. 
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7.) Where the sequences of multiple variants of the same target protein are present, potential peptide 
epitopes can also be selected on the basis of their conservancy. For example, a criterion for conservancy may 
define that the entire sequence of an HLA class I binding peptide or the entire 9-mer core of a class II binding 
peptide be conserved in a designated percentage of the sequences evaluated for a specific protein antigen. 
5 X.C.I. Minigene Vaccines 

A number of different approaches are available which allow simultaneous delivery of multiple epitopes. 
Nucleic acids encoding the peptides of the invention are a particularly useful embodiment of the invention. 
Epitopes for inclusion in a minigene are preferably selected according to the guidelines set forth in the previous 
section. A preferred means of administering nucleic acids encoding the peptides of the invention uses minigene 

10 constructs encoding a peptide comprising one or multiple epitopes of the invention. 

The use of multi-epitope minigenes is described below and in, Ishioka et a!., J. Immunol. 162:3915-3925, 
1999; An, L. and Whitton, J. L., J. Virol. 71:2292, 1997; Thomson, S. A. et at., J. Immunol. 157:822, 1996; 
Whitton, J. L. et al., J. Virol. 67:348, 1993; Hanke, R. et at., Vaccine 16:426, 1998. For example, a multi-epitope 
DNA plasmid encoding supermotif- and/or motif-bearing epitopes derived 101P3A1 1, the PADRE® universal 

15 helper T cell epitope (or multiple HTL epitopes from 101P3A1 1), and an endoplasmic reticulum-translocating 
signal sequence can be engineered. A vaccine may also comprise epitopes that are derived from other TAAs. 

The immunogenicity of a multi-epitopic minigene can be confirmed in transgenic mice to evaluate the 
magnitude of CTL induction responses against the epitopes tested. Further, the immunogenicity of DN A-encoded 
epitopes in vivo can be correlated with the in vitro responses of specific CTL lines against target cells transfected 

20 with the DNA plasmid. Thus, these experiments can show that the minigene serves to both: 1 .) generate a CTL 
response and 2.) that the induced CTLs recognized cells expressing the encoded epitopes. 

For example, to create a DNA sequence encoding the selected epitopes (minigene) for expression in human 
cells, the amino acid sequences of the epitopes may be reverse translated. A human codon usage table can be used 
to guide the codon choice for each amino acid. These epitope-encoding DNA sequences may be directly adjoined, 

25 so that when translated, a continuous polypeptide sequence is created. To optimize expression and/or 

immunogenicity, additional elements can be incorporated into the minigene design. Examples of amino acid 
sequences that can be reverse translated and included in the minigene sequence include: HLA class I epitopes, HLA 
class II epitopes, antibody epitopes, a ubiquitination signal sequence, and/or an endoplasmic reticulum targeting 
signal. In addition, HLA presentation of CTL and HTL epitopes may be improved by including synthetic (e.g. 

30 poly-alanine) or naturally-occurring flanking sequences adjacent to the CTL or HTL epitopes; these larger peptides 
comprising the epitope(s) are within the scope of the invention. 

The minigene sequence may be converted to DNA by assembling oligonucleotides that encode the plus 
and minus strands of the minigene. Overlapping oligonucleotides (30-100 bases long) may be synthesized, 
phosphorylated, purified and annealed under appropriate conditions using well known techniques. The ends of the 

35 oligonucleotides can be joined, for example, using T4 DNA Hgase. This synthetic minigene, encoding the epitope 
polypeptide, can then be cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are preferably included in the vector 
to ensure expression in the target cells. Several vector elements are desirable: a promoter with a down-stream 
cloning site for minigene insertion; a polyadenylation signal for efficient transcription termination; an E. coli origin 

40 of replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). Numerous promoters can 
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be used for this purpose, e.g., the human cytomegalovirus (hCMV) promoter. See, e.g., U.S. Patent Nos. 5,580,859 
and 5,589,466 for other suitable promoter sequences. 

Additional vector modifications may be desired to optimize minigene expression and immunogenicity. In 
some cases, introns are required for efficient gene expression, and one or more synthetic or naturally-occurring 
5 introns could be incorporated into the transcribed region of the minigene. The inclusion of mRNA stabilization 
sequences and sequences for replication in mammalian cells may also be considered for increasing minigene 
expression. 

Once an expression vector is selected, the minigene is cloned into the polylinker region downstream of the 
promoter. This plasmid is transformed into an appropriate E. coli strain, and DNA is prepared using standard 

1 0 techniques. The orientation and DNA sequence of the minigene, as well as all other elements included in the 

vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial cells harboring the correct 
plasmid can be stored as a master cell bank and a working cell bank. 

In addition, immunostimulatory sequences (ISSs or CpGs) appear to play a role in the immunogenicity of 
DNA vaccines. These sequences may be included in the vector, outside the minigene coding sequence, if desired to 

1 5 enhance immunogenicity. 

In some embodiments, a bi-cistronic expression vector which allows production of both the minigene- 
encoded epitopes and a second protein (included to enhance or decrease immunogenicity) can be used. Examples 
of proteins or polypeptides that could beneficially enhance the immune response if co-expressed include cytokines 
{e.g., IL-2, IL-12, GM-CSF), cytokine-inducing molecules {e.g., LeIF), costimulatory molecules, or for HTL 

20 responses, pan-DR binding proteins (PADRE™, Epimmune, San Diego, CA). Helper (HTL) epitopes can be joined 
to intracellular targeting signals and expressed separately from expressed CTL epitopes; this allows direction of the 
HTL epitopes to a cell compartment different than that of the CTL epitopes. If required, this could facilitate more 
efficient entry of HTL epitopes into the HL A class II pathway, thereby improving HTL induction. In contrast to 
HTL or CTL induction, specifically decreasing the immune response by co-expression of immunosuppressive 

25 molecules {e.g. TGF-p) may be beneficial in certain diseases. 

Therapeutic quantities of plasmid DNA can be produced for example, by fermentation in E. coli, followed 
by purification. Aliquots from the working cell bank are used to inoculate growth medium, and grown to saturation 
in shaker flasks or a bioreactor according to well-known techniques. Plasmid DNA can be purified using standard 
bioseparation technologies such as solid phase anion-exchange resins supplied by QIAGEN, Inc. (Valencia, 

30 California). If required, supercoiled DNA can be isolated from the open circular and linear forms using gel 
electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of formulations. The simplest of these 
is reconstitution of lyophilized DNA in sterile phosphate-buffer saline (PBS). This approach, known as "naked 
DNA," is currently being used for intramuscular (IM) administration in clinical trials. To maximize the 

35 immunotherapeutic effects of minigene DNA vaccines, an alternative method for formulating purified plasmid 
DNA may be desirable. A variety of methods have been described, and new techniques may become available. 
Cationic lipids, glycolipids, and fusogenic liposomes can also be used in the formulation (see, e.g., as described by 
WO 93/24640; Mannino & Gould-Fogerite, BioTechniques 6(7): 682 (1988); U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner, et ai, Proc. Nat 'I Acad. Sci. USA 84:7413 (1987). In addition, peptides and compounds 

40 referred to collectively as protective, interactive, non-condensing compounds (PINC) could also be complexed to 
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purified plasmid DNA to influence variables such as stability, intramuscular dispersion, or trafficking to specific 
organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and HLA class I presentation of 
minigene-encoded CTL epitopes. For example, the plasmid DNA is introduced into a mammalian cell line that is 
5 suitable as a target for standard CTL chromium release assays. The transfection method used will be dependent on 
the final formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow direct in vitro 
transfection. A plasmid expressing green fluorescent protein (GFP) can be co-transfected to allow enrichment of 
transfected cells using fluorescence activated cell sorting (FACS). These cells are then chromium-51 ( 51 Cr) labeled 
and used as target cells for epitope-specific CTL lines; cytolysis, detected by 5I Cr release, indicates both production 
10 of, and HLA presentation of, minigene-encoded CTL epitopes. Expression of HTL epitopes may be evaluated in an 
analogous manner using assays to assess HTL activity. 

In vivo immunogenicity is a second approach for functional testing of minigene DNA formulations. 
Transgenic mice expressing appropriate human HLA proteins are immunized with the DNA product. The dose and 
route of administration are formulation dependent (e.g., Bvl for DNA in PBS, intraperitoneal (i.p.) for lipid- 
15 complexed DNA). Twenty-one days after immunization, splenocytes are harvested and restimulated for one week 
in the presence of peptides encoding each epitope being tested. Thereafter, for CTL effector cells, assays are 
conducted for cytolysis of peptide-loaded, 51 Cr-labeled target cells using standard techniques. Lysis of target cells 
that were sensitized by HLA loaded with peptide epitopes, corresponding to minigene-encoded epitopes, 
demonstrates DNA vaccine function for in vivo induction of CTLs. Immunogenicity of HTL epitopes is confirmed 
20 in transgenic mice in an analogous manner. 

Alternatively, the nucleic acids can be administered using ballistic delivery as described, for instance, in 
U.S. Patent No. 5,204,253. Using this technique, particles comprised solely of DNA are administered. In a further 
alternative embodiment, DNA can be adhered to particles, such as gold particles. 

Minigenes can also be delivered using other bacterial or viral delivery systems well known in the art, e.g., 
25 an expression construct encoding epitopes of the invention can be incorporated into a viral vector such as vaccinia. 
X.C.2. Combinations of CTL Peptides with Helper Peptides 
Vaccine compositions comprising CTL peptides of the invention can be modified, e.g., analoged, to 
provide desired attributes, such as improved serum half life, broadened population coverage or enhanced 
immunogenicity. 

30 For instance, the ability of a peptide to induce CTL activity can be enhanced by linking the peptide to a 

sequence which contains at least one epitope that is capable of inducing a T helper cell response. Although a CTL 
peptide can be directly linked to a T helper peptide, often CTL epitope/HTL epitope conjugates are linked by a 
spacer molecule. The spacer is typically comprised of relatively small, neutral molecules, such as amino acids or 
amino acid mimetics, which are substantially uncharged under physiological conditions. The spacers are typically 

35 selected from, e.g., Ala, Gly, or other neutral spacers of nonpolar amino acids or neutral polar amino acids. It will 
be understood that the optionally present spacer need not be comprised of the same residues and thus may be a 
hetero- or homo-oligomer. When present, the spacer will usually be at least one or two residues, more usually three 
to six residues and sometimes 10 or more residues. The CTL peptide epitope can be linked to the T helper peptide 
epitope either directly or via a spacer either at the amino or carboxy terminus of the CTL peptide. The amino 

40 terminus of either the immunogenic peptide or the T helper peptide may be acylated. 
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In certain embodiments, the T helper peptide is one that is recognized by T helper cells present in a 
majority of a genetically diverse population. This can be accomplished by selecting peptides that bind to many, 
most, or all of the HLA class II molecules. Examples of such amino acid bind many HLA Class II molecules 
include sequences from antigens such as tetanus toxoid at positions 830-843 (QYIKANSKFIGITE; SEQ ID NO: 
5 2876), Plasmodium falciparum circumsporozoite (CS) protein at positions 378-398 

(DIEKKIAKMEKAS S VFNV VNS ; SEQ ID NO: 2877), and Streptococcus 18kD protein at positions 116-131 
(GAVDSILGGVATYGAA; SEQ ID NO: 2878). Other examples include peptides bearing a DR 1-4-7 supermotif, 
or either of the DR3 motifs. 

Alternatively, it is possible to prepare synthetic peptides capable of stimulating T helper lymphocytes, in a 

10 loosely HLA-restricted fashion, using amino acid sequences not found in nature {see, e.g., PCT publication WO 
95/07707). These synthetic compounds called Pan-DR-binding epitopes (e.g., PADRE™, Epimmune, Inc., San 
Diego, C A) are designed to most preferably bind most HLA-DR (human HLA class II) molecules. For instance, a 
pan-DR-binding epitope peptide having the formula: aKXVAAWTLKAAa (SEQ ID NO: 2879), where "X" is 
either cyclohexylalanine, phenylalanine, or tyrosine, and a is either D-alanine or L-alanine, has been found to bind to 

1 5 most HLA-DR alleles, and to stimulate the response of T helper lymphocytes from most individuals, regardless of 
their HLA type. An alternative of a pan-DR binding epitope comprises all "L" natural amino acids and can be 
provided in the form of nucleic acids that encode the epitope. 

HTL peptide epitopes can also be modified to alter their biological properties. For example, they can be 
modified to include D-amino acids to increase their resistance to proteases and thus extend their serum half life, or 

20 they can be conjugated to other molecules such as lipids, proteins, carbohydrates, and the like to increase their 

biological activity. For example, a T helper peptide can be conjugated to one or more palmitic acid chains at either 
the amino or carboxyl termini. 

X.C.3. Combinations of CTL Peptides with T Cell Priming Agents 
In some embodiments it may be desirable to include in the pharmaceutical compositions of the invention at 

25 least one component which primes B lymphocytes or T lymphocytes. Lipids have been identified as agents capable 
of priming CTL in vivo. For example, palmitic acid residues can be attached to the e-and a- amino groups of a 
lysine residue and then linked, e.g., via one or more linking residues such as Gly, Gly-Gly-, Ser, Ser-Ser, or the like, 
to an immunogenic peptide. The lipidated peptide can then be administered either directly in a micelle or particle, 
incorporated into a liposome, or emulsified in an adjuvant, e.g., incomplete Freund's adjuvant. In a preferred 

30 embodiment, a particularly effective immunogenic composition comprises palmitic acid attached to e- and a- amino 
groups of Lys, which is attached via linkage, e.g., Ser-Ser, to the amino terminus of the immunogenic peptide. 

As another example of lipid priming of CTL responses, E. coli lipoproteins, such as tripalmitoyl-S- 
glycerylcysteinlyseryl- serine (P 3 CSS) can be used to prime virus specific CTL when covalently attached to an 
appropriate peptide (see, e.g., Deres, et al., Nature 342:561, 1989). Peptides of the invention can be coupled to 

35 P 3 CSS } for example, and the lipopeptide administered to an individual to specifically prime an immune response to 
the target antigen. Moreover, because the induction of neutralizing antibodies can also be primed with P 3 CSS- 
conjugated epitopes, two such compositions can be combined to more effectively elicit both humoral and cell- 
mediated responses. 

X.C.4. Vaccine Compositions Comprising DC Pulsed with CTL and/or HTL Peptides 

40 An embodiment of a vaccine composition in accordance with the invention comprises ex vivo 

administration of a cocktail of epitope-bearing peptides to PBMC, or isolated DC therefrom, from the patient's 
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blood. A pharmaceutical to facilitate harvesting of DC can be used, such as Progenipoietin™ (Pharmacia- 
Monsanto, St. Louis, MO) or GM-CSF/IL-4. After pulsing the DC with peptides and prior to reinfusion into 
patients, the DC are washed to remove unbound peptides. In this embodiment, a vaccine comprises peptide-pulsed 
DCs which present the pulsed peptide epitopes complexed with HLA molecules on their surfaces. 
5 The DC can be pulsed ex vivo with a cocktail of peptides, some of which stimulate CTL responses to 

101P3A1 1. Optionally, a helper T cell (HTL) peptide, such as a natural or artificial loosely restricted HLA Class II 
peptide, can be included to facilitate the CTL response. Thus, a vaccine in accordance with the invention is used to 
treat a cancer which expresses or overexpresses 101P3A1 1. 
X.D. Adoptive Immunotherapy 

10 Antigenic 101P3A1 1 -related peptides are used to elicit a CTL and/or HTL response ex vivo, as well. The 

resulting CTL or HTL cells, can be used to treat tumors in patients that do not respond to other conventional forms 
of therapy, or will not respond to a therapeutic vaccine peptide or nucleic acid in accordance with the invention. Ex 
vivo CTL or HTL responses to a particular antigen are induced by incubating in tissue culture the patient's, or 
genetically compatible, CTL or HTL precursor cells together with a source of antigen-presenting cells (APC), such 

15 as dendritic cells, and the appropriate immunogenic peptide. After an appropriate incubation time (typically about 
7-28 days), in which the precursor cells are activated and expanded into effector cells, the cells are infused back into 
the patient, where they will destroy (CTL) or facilitate destruction (HTL) of their specific target cell (e.g., a tumor 
cell). Transfected dendritic cells may also be used as antigen presenting cells. 

X.E. Administration of Vaccines for Therapeutic or Prophylactic Purposes 

20 Pharmaceutical and vaccine compositions of the invention are typically used to treat and/or prevent a 

cancer that expresses or overexpresses 101P3A1 1. In therapeutic applications, peptide and/or nucleic acid 
compositions are administered to a patient in an amount sufficient to elicit an effective B cell, CTL and/or HTL 
response to the antigen and to cure or at least partially arrest or slow symptoms and/or complications. An amount 
adequate to accomplish this is defined as "therapeutically effective dose." Amounts effective for this use will 

25 depend on, e.g., the particular composition administered, the manner of administration, the stage and severity of the 
disease being treated, the weight and general state of health of the patient, and the judgment of the prescribing 
physician. 

For pharmaceutical compositions, the immunogenic peptides of the invention, or DNA encoding them, are 
generally administered to an individual already bearing a tumor that expresses 101P3A1 1. The peptides or DNA 
30 encoding them can be administered individually or as fusions of one or more peptide sequences. Patients can be 
treated with the immunogenic peptides separately or in conjunction with other treatments, such as surgery, as 
appropriate. 

For therapeutic use, administration should generally begin at the first diagnosis of 101P3A1 1 -associated 
cancer. This is followed by boosting doses until at least symptoms are substantially abated and for a period 
35 thereafter. The embodiment of the vaccine composition {i.e., including, but not limited to embodiments such as 

peptide cocktails, polyepitopic polypeptides, minigenes, or TAA-specific CTLs or pulsed dendritic cells) delivered 
to the patient may vary according to the stage of the disease or the patient's health status. For example, in a patient 
with a tumor that expresses 101P3A1 1, a vaccine comprising 10 1P3A1 1 -specific CTL may be more efficacious in 
killing tumor cells in patient with advanced disease than alternative embodiments. 
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It is generally important to provide an amount of the peptide epitope delivered by a mode of administration 
sufficient to effectively stimulate a cytotoxic T cell response; compositions which stimulate helper T cell responses 
can also be given in accordance with this embodiment of the invention. 

The dosage for an initial therapeutic immunization generally occurs in a unit dosage range where the lower 
5 value is about 1, 5 S 50, 500, or 1,000 ug and the higher value is about 10,000; 20,000; 30,000; or 50,000 ug. 

Dosage values for a human typically range from about 500 ug to about 50,000 ug per 70 kilogram patient. Boosting 
dosages of between about 1.0 jag to about 50,000 |ig of peptide pursuant to a boosting regimen over weeks to 
months may be administered depending upon the patient's response and condition as determined by measuring the 
specific activity of CTL and HTL obtained from the patient's blood. Administration should continue until at least 

1 0 clinical symptoms or laboratory tests indicate that the neoplasia, has been eliminated or reduced and for a period 
thereafter. The dosages, routes of administration, and dose schedules are adjusted in accordance with 
methodologies known in the art. 

In certain embodiments, the peptides and compositions of the present invention are employed in serious 
disease states, that is, life-threatening or potentially life threatening situations. In such cases, as a result of the 

1 5 minimal amounts of extraneous substances and the relative nontoxic nature of the peptides in preferred 

compositions of the invention, it is possible and may be felt desirable by the treating physician to administer 
substantial excesses of these peptide compositions relative to these stated dosage amounts. 

The vaccine compositions of the invention can also be used purely as prophylactic agents. Generally the 
dosage for an initial prophylactic immunization generally occurs in a unit dosage range where the lower value is 

20 about 1, 5, 50, 500, or 1000 ug and the higher value is about 10,000; 20,000; 30,000; or 50,000 ug. Dosage values 
for a human typically range from about 500 ug to about 50,000 ug per 70 kilogram patient. This is followed by 
boosting dosages of between about 1.0 jig to about 50,000 jig of peptide administered at defined intervals from 
about four weeks to six months after the initial administration of vaccine. The immunogenicity of the vaccine can 
be assessed by measuring the specific activity of CTL and HTL obtained from a sample of the patient's blood. 

25 The pharmaceutical compositions for therapeutic treatment are intended for parenteral, topical, oral, nasal, 

intrathecal, or local (e.g. as a cream or topical ointment) administration. Preferably, the pharmaceutical 
compositions are administered parentally, e.g., intravenously, subcutaneously, intradermally, or intramuscularly. 
Thus, the invention provides compositions for parenteral administration which comprise a solution of the 
immunogenic peptides dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. 

30 A variety of aqueous carriers may be used, e.g., water, buffered water, 0.8% saline, 0.3% glycine, 

hyaluronic acid and the like. These compositions may be sterilized by conventional, well-known sterilization 
techniques, or may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or lyophilized, 
the lyophilized preparation being combined with a sterile solution prior to administration. 

The compositions may contain pharmaceutical ly acceptable auxiliary substances as required to 

35 approximate physiological conditions, such as pH-adjusting and buffering agents, tonicity adjusting agents, wetting 
agents, preservatives, and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium 
chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of peptides of the invention in the pharmaceutical formulations can vary widely, i.e., 
from less than about 0. 1%, usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 

40 selected primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of administration 
selected. 
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A human unit dose form of a composition is typically included in a pharmaceutical composition that 
comprises a human unit dose of an acceptable carrier, in one embodiment an aqueous carrier, and is administered in 
a volume/quantity that is known by those of skill in the art to be used for administration of such compositions to 
humans (see, e.g., Remington's Pharmaceutical Sciences, 17 th Edition, A. Gennaro, Editor, Mack Publishing Co., 
5 Easton, Pennsylvania, 1985). For example a peptide dose for initial immunization can be from about 1 to about 

50,000 |Xg, generally 100-5,000 u\g, for a 70 kg patient. For example, for nucleic acids an initial immunization may 
be performed using an expression vector in the form of naked nucleic acid administered IM (or SC or ID) in the 
amounts of 0.5-5 mg at multiple sites. The nucleic acid (0.1 to 1000 p,g) can also be administered using a gene gun 
(i.e., ballistic delivery) in accordance with regimens known in the art. Following an incubation period of 3-4 weeks, 
10 a booster dose is then administered. The booster can be recombinant fowlpox virus administered at a dose of 5-10 7 
to 5xl0 9 pfu. 

For antibodies, a treatment generally involves repeated administration of the anti-101P3Al 1 antibody 
preparation, via an acceptable route of administration such as intravenous injection (IV), typically at a dose in the 
range of about 0. 1 to about 10 mg/kg body weight. In general, doses in the range of 10-500 mg mAb per week are 

1 5 effective and well tolerated. Moreover, an initial loading dose of approximately 4 mg/kg patient body weight IV, 
followed by weekly doses of about 2 mg/kg IV of the anti- 101P3A1 1 mAb preparation represents an acceptable 
dosing regimen. As appreciated by those of skill in the art, various factors can influence the ideal dose in a 
particular case. Such factors include, for example, half life of a composition, the binding affinity of an Ab, the 
immunogenicity of a substance, the degree of 101P3A1 1 expression in the patient, the extent of circulating shed 

20 101P3A1 1 antigen, the desired steady-state concentration level, frequency of treatment, and the influence of 

chemotherapeutic or other agents used in combination with the treatment method of the invention, as well as the 
health status of a particular patient. Non- limiting preferred human unit doses are, for example, 500ug - lmg, lmg - 
50mg, 50mg-100mg, lOOmg - 200 mg, 200mg - 300mg, 400mg - 500mg, 500mg - 600mg, 600mg - 700mg, or lmg 
- 700mg. In certain embodiments, the dose is in a range of 2-5 mg/kg body weight, e.g., with follow on weekly 

25 doses of 1-3 mg/kg; 0.5mg, 1, 2, 3, 4, 5, 6, 7, 8, 9, lOmg/kg body weight followed, e.g., in two, three or four weeks 
by weekly doses; 0.5 - lOmg/kg body weight, e.g., followed in two, three or four weeks by weekly doses; 225, 250, 
275, 300, 325, 350, 375, 400mg m 2 of body area weekly; l-600mg m 2 of body area weekly; 225-400mg m 2 of body 
area weekly; these does can be followed by weekly doses for 2, 3, 4, 5, 6, 7, 8, 9, 19, 1 1, 12 or more weeks. 

In one embodiment, human unit dose forms of polynucleotides comprise a suitable dosage range or 

30 effective amount that provides any therapeutic effect. As appreciated by one of ordinary skill in the art a 

therapeutic effect depends on a number of factors, including the sequence of the polynucleotide, molecular weight 
of the polynucleotide and route of administration. Dosages are generally selected by the physician or other health 
care professional in accordance with a variety of parameters known in the art, such as severity of symptoms, history 
of the patient and the like. Generally, for a polynucleotide of about 20 bases, a dosage range may be selected from, 

35 for example, an independently selected lower limit such as about 0.1, 0.25, 0.5, 1, 2, 5, 10, 20, 30, 40, 50, 60, 80, 
100, 200, 300, 400 or 500 mg/kg up to an independently selected upper limit, greater than the lower limit, of about 
60, 80, 100, 200, 300, 400, 500, 750, 1000, 1500, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10,000 
mg/kg. For example, a dose may be about any of the following: 0.1 to 100 mg/kg, 0.1 to 50 mg/kg, 0.1 to 25 
mg/kg, 0.1 to 10 mg/kg, 1 to 500 mg/kg, 100 to 400 mg/kg, 200 to 300 mg/kg, 1 to 100 mg/kg, 100 to 200 mg/kg, 

40 300 to 400 mg/kg, 400 to 500 mg/kg, 500 to 1000 mg/kg, 500 to 5000 mg/kg, or 500 to 10,000 mg/kg. Generally, 
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parenteral routes of administration may require higher doses of polynucleotide compared to more direct application 
to the nucleotide to diseased tissue, as do polynucleotides of increasing length. 

In one embodiment, human unit dose forms of T-cells comprise a suitable dosage range or effective 
amount that provides any therapeutic effect. As appreciated by one of ordinary skill in the art, a therapeutic effect 
5 depends on a number of factors. Dosages are generally selected by the physician or other health care professional 
in accordance with a variety of parameters known in the art, such as severity of symptoms, history of the patient and 
the like. A dose may be about 10 4 cells to about 10 6 cells, about 10 6 cells to about 10 8 cells, about 10 8 to about 10 11 
cells, or about 10 8 to about 5 x 10 10 cells. A dose may also about 10 6 cells/m 2 to about 10 10 cells/m 2 , or about 10 6 
cells/m 2 to about 10 8 cells/m 2 . 

10 Proteins(s) of the invention, and/or nucleic acids encoding the protein(s), can also be administered via 

liposomes, which may also serve to: 1) target the proteins(s) to a particular tissue, such as lymphoid tissue; 2) to 
target selectively to diseases cells; or, 3) to increase the half-life of the peptide composition. Liposomes include 
emulsions, foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the 
like. In these preparations, the peptide to be delivered is incorporated as part of a liposome, alone or in conjunction 

1 5 with a molecule which binds to a receptor prevalent among lymphoid cells, such as monoclonal antibodies which 
bind to the CD45 antigen, or with other therapeutic or immunogenic compositions. Thus, liposomes either filled or 
decorated with a desired peptide of the invention can be directed to the site of lymphoid cells, where the liposomes 
then deliver the peptide compositions. Liposomes for use in accordance with the invention are formed from 
standard vesicle-forming lipids, which generally include neutral and negatively charged phospholipids and a sterol, 

20 such as cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome size, acid lability 
and stability of the liposomes in the blood stream. A variety of methods are available for preparing liposomes, as 
described in, e.g., Szoka, et al, Ann. Rev. Biophys. Bioeng. 9:467 (1980), and U.S. Patent Nos. 4,235,871, 
4,501,728, 4,837,028, and 5,019,369. 

For targeting cells of the immune system, a ligand to be incorporated into the liposome can include, e.g., 

25 antibodies or fragments thereof specific for cell surface determinants of the desired immune system cells. A 

liposome suspension containing a peptide may be administered intravenously, locally, topically, etc. in a dose which 
varies according to, inter alia, the manner of administration, the peptide being delivered, and the stage of the 
disease being treated. 

For solid compositions, conventional nontoxic solid carriers may be used which include, for example, 
30 pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharin, talcum, cellulose, 
glucose, sucrose, magnesium carbonate, and the like. For oral administration, a pharmaceutically acceptable 
nontoxic composition is formed by incorporating any of the normally employed excipients, such as those carriers 
previously listed, and generally 10-95% of active ingredient, that is, one or more peptides of the invention, and 
more preferably at a concentration of 25%-75%. 
35 For aerosol administration, immunogenic peptides are preferably supplied in finely divided form along 

with a surfactant and propellant. Typical percentages of peptides are about 0.01%-20% by weight, preferably about 
1%-10%. The surfactant must, of course, be nontoxic, and preferably soluble in the propellant. Representative of 
such agents are the esters or partial esters of fatty acids containing from about 6 to 22 carbon atoms, such as 
caproic, octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an aliphatic polyhydric 
40 alcohol or its cyclic anhydride. Mixed esters, such as mixed or natural glycerides may be employed. The surfactant 
may constitute about 0.1%-20% by weight of the composition, preferably about 0.25-5%. The balance of the 
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composition is ordinarily propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

XI.) Diagnostic and Prognostic Embodiments of 101P3A11 . 

5 As disclosed herein, 101P3A1 1 polynucleotides, polypeptides, reactive cytotoxic T cells (CTL), reactive 

helper T cells (HTL) and anti-polypeptide antibodies are used in well known diagnostic, prognostic and therapeutic 
assays that examine conditions associated with dysregulated cell growth such as cancer, in particular the cancers 
listed in Table I (see, e.g., both its specific pattern of tissue expression as well as its overexpression in certain 
cancers as described, e.g., in the Example entitled "Expression analysis ofl01P3All in normal tissues and patient 
10 specimens"). 

101P3A1 1 can be analogized to a prostate associated antigen PSA, the archetypal marker that has been 
used by medical practitioners for years to identify and monitor the presence of prostate cancer (see, e.g., Merrill et 
al., J. Urol. 163(2): 503-5120 (2000); Polascik et al., J. Urol. Aug; 162(2):293-306 (1999) and Fortier et al., J. Nat. 
Cancer Inst. 91(19): 1635-1640(1999)). A variety of other diagnostic markers are also used in similar contexts 

1 5 including p53 and K-ras (see, e.g., Tulchinsky et al., Int J Mol Med 1999 Jul 4(1):99-102 and Minimoto et al., 

Cancer Detect Prev 2000;24(1): 1-12). Therefore, this disclosure of the 101P3A1 1 polynucleotides and polypeptides 
(as well as the 101P3A1 1 polynucleotide probes and anti-101P3Al 1 antibodies used to identify the presence of 
these molecules) and their properties allows skilled artisans to utilize these molecules in methods that are analogous 
to those used, for example, in a variety of diagnostic assays directed to examining conditions associated with 

20 cancer. 

Typical embodiments of diagnostic methods which utilize the 101P3A1 1 polynucleotides, polypeptides, 
reactive T cells and antibodies are analogous to those methods from well-established diagnostic assays which 
employ, e.g., PSA polynucleotides, polypeptides, reactive T cells and antibodies. For example, just as PSA 
polynucleotides are used as probes (for example in Northern analysis, see, e.g., Sharief et al., Biochem. Mol. Biol. 

25 Int. 33(3):567-74(1994)) and primers (for example in PCR analysis, see, e.g., Okegawa et al., J. Urol. 163(4): 1 189- 
1 190 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of monitoring PSA 
overexpression or the metastasis of prostate cancers, the 101P3A1 1 polynucleotides described herein can be utilized 
in the same way to detect 101P3A1 1 overexpression or the metastasis of prostate and other cancers expressing this 
gene. Alternatively, just as PSA polypeptides are used to generate antibodies specific for PSA which can then be 

30 used to observe the presence and/or the level of PSA proteins in methods to monitor PSA protein overexpression 
(see, e.g., Stephan et al., Urology 55(4):560-3 (2000)) or the metastasis of prostate cells (see, e.g., Alanen et al., 
Pathol. Res. Pract. 192(3):233-7 (1996)), the 101P3A1 1 polypeptides described herein can be utilized to generate 
antibodies for use in detecting 101P3A1 1 overexpression or the metastasis of prostate cells and cells of other 
cancers expressing this gene. 

35 Specifically, because metastases involves the movement of cancer cells from an organ of origin (such as 

the lung or prostate gland etc.) to a different area of the body (such as a lymph node), assays which examine a 
biological sample for the presence of cells expressing 101P3A1 1 polynucleotides and/or polypeptides can be used 
to provide evidence of metastasis. For example, when a biological sample from tissue that does not normally 
contain 101 P3A1 1 -expressing cells (lymph node) is found to contain 101P3A1 1 -expressing cells such as the 

40 101P3A1 1 expression seen in LAPC4 and LAPC9, xenografts isolated from lymph node and bone metastasis, 
respectively, this finding is indicative of metastasis. 
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Alternatively 101P3A1 1 polynucleotides and/or polypeptides can be used to provide evidence of cancer, 
for example, when cells in a biological sample that do not normally express 101P3A1 1 or express 101P3A1 1 at a 
different level are found to express 101P3A1 1 or have an increased expression of 101P3A1 1 (see, e.g., the 
101P3A1 1 expression in the cancers listed in Table I and in patient samples etc. shown in the accompanying 
5 Figures). In such assays, artisans may further wish to generate supplementary evidence of metastasis by testing the 
biological sample for the presence of a second tissue restricted marker (in addition to 101P3A1 1) such as PSA, 
PSCA etc. (see, e.g., Alanen et al., Pathol. Res. Pract. 192(3): 233-237 (1996)). 

Just as PSA polynucleotide fragments and polynucleotide variants are employed by skilled artisans for use 
in methods of monitoring PSA, 101P3A1 1 polynucleotide fragments and polynucleotide variants are used in an 

10 analogous manner. In particular, typical PSA polynucleotides used in methods of monitoring PSA are probes or 
primers which consist of fragments of the PSA cDNA sequence. Illustrating this, primers used to PCR amplify a 
PSA polynucleotide must include less than the whole PSA sequence to function in the polymerase chain reaction. 
In the context of such PCR reactions, skilled artisans generally create a variety of different polynucleotide 
fragments that can be used as primers in order to amplify different portions of a polynucleotide of interest or to 

15 optimize amplification reactions (see, e.g., Caetano-Anolles, G. Biotechniques 25(3): 472-476, 478-480 (1998); 

Robertson et al., Methods Mol. Biol. 98:121-154 (1998)). An additional illustration of the use of such fragments is 
provided in the Example entitled "Expression analysis of 101P3A1 1 in normal tissues and patient specimens", 
where a 101P3A1 1 polynucleotide fragment is used as a probe to show the expression of 101P3A1 1 RNAs in 
cancer cells. In addition, variant polynucleotide sequences are typically used as primers and probes for the 

20 corresponding mRNAs in PCR and Northern analyses (see, e.g., Sawai et al., Fetal Diagn. Ther. 1996 Nov-Dec 
1 1(6):407-13 and Current Protocols In Molecular Biology, Volume 2, Unit 2, Frederick M. Ausubel et al. eds., 
1995)). Polynucleotide fragments and variants are useful in this context where they are capable of binding to a 
target polynucleotide sequence (e.g. the 101P3A1 1 polynucleotide shown in SEQ ID NO: 701) under conditions of 
high stringency. 

25 Furthermore, PSA polypeptides which contain an epitope that can be recognized by an antibody or T cell 

that specifically binds to that epitope are used in methods of monitoring PSA. 101P3A1 1 polypeptide fragments 
and polypeptide analogs or variants can also be used in an analogous manner. This practice of using polypeptide 
fragments or polypeptide variants to generate antibodies (such as anti-PSA antibodies or T cells) is typical in the art 
with a wide variety of systems such as fusion proteins being used by practitioners (see, e.g., Current Protocols In 

30 Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubel et al. eds., 1995). In this context, each epitope(s) 

functions to provide the architecture with which an antibody or T cell is reactive. Typically, skilled artisans create a 
variety of different polypeptide fragments that can be used in order to generate immune responses specific for 
different portions of a polypeptide of interest (see, e.g., U.S. Patent No. 5,840,501 and U.S. Patent No. 5,939,533). 
For example it may be preferable to utilize a polypeptide comprising one of the 101P3A1 1 biological motifs 

35 discussed herein or a motif-bearing subsequence which is readily identified by one of skill in the art based on motifs 
available in the art. Polypeptide fragments, variants or analogs are typically useful in this context as long as they 
comprise an epitope capable of generating an antibody or T cell specific for a target polypeptide sequence (e.g. the 
101P3A1 1 polypeptide shown in SEQ ID NO: 703). 

As shown herein, the 101P3A1 1 polynucleotides and polypeptides (as well as the 101P3A1 1 

40 polynucleotide probes and anti-101P3Al 1 antibodies or T cells used to identify the presence of these molecules) 
exhibit specific properties that make them useful in diagnosing cancers such as those listed in Table I. Diagnostic 
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assays that measure the presence of 101P3A1 1 gene products, in order to evaluate the presence or onset of a disease 
condition described herein, such as prostate cancer, are used to identify patients for preventive measures or further 
monitoring, as has been done so successfully with PSA. Moreover, these materials satisfy a need in the art for 
molecules having similar or complementary characteristics to PSA in situations where, for example, a definite 
5 diagnosis of metastasis of prostatic origin cannot be made on the basis of a test for PSA alone (see, e.g., Alanen et 
al., Pathol. Res. Pract. 192(3): 233-237 (1996)), and consequently, materials such as 101P3A1 1 polynucleotides and 
polypeptides (as well as the 101P3A1 1 polynucleotide probes and anti-101P3Al 1 antibodies used to identify the 
presence of these molecules) must be employed to confirm metastases of prostatic origin. 

Finally, in addition to their use in diagnostic assays, the 101P3A1 1 polynucleotides disclosed herein have a 
10 number of other utilities such as their use in the identification of oncogenetic associated chromosomal abnormalities 
in the chromosomal region to which the 101P3A1 1 gene maps (see Table XXV). Moreover, in addition to their use 
in diagnostic assays, the 10 1P3A1 1 -related proteins and polynucleotides disclosed herein have other utilities such as 
their use in the forensic analysis of tissues of unknown origin (see, e.g., Takahama K Forensic Sci Int 1996 Jun 
28;80(l-2): 63-9). 

15 Additionally, 101P3A1 1 -related proteins or polynucleotides of the invention can be used to treat a 

pathologic condition characterized by the over-expression of 101P3A1 1 . For example, the amino acid or nucleic 
acid sequence of Figure 2 or Figure 3, or fragments of either, can be used to generate an immune response to the 
101P3A1 1 antigen. Antibodies or other molecules that react with 101P3A1 1 can be used to modulate the function 
of this molecule, and thereby provide a therapeutic benefit. 

20 

XII,) Inhibition of 101P3A11 Protein Function 

The invention includes various methods and compositions for inhibiting the binding of 101P3A1 1 to its 
binding partner or its association with other protein(s) as well as methods for inhibiting 101P3A1 1 function. 
XII. A.) Inhibition of 101P3A1 1 With Intracellular Antibodies 

25 In one approach, a recombinant vector that encodes single chain antibodies that specifically bind to 

101P3A1 1 are introduced into 101P3A1 1 expressing cells via gene transfer technologies. Accordingly, the encoded 
single chain anti-101P3Al 1 antibody is expressed intracellularly, binds to 101P3A1 1 protein, and thereby inhibits 
its function. Methods for engineering such intracellular single chain antibodies are well known. Such intracellular 
antibodies, also known as "intrabodies", are specifically targeted to a particular compartment within the cell, 

30 providing control over where the inhibitory activity of the treatment is focused. This technology has been 

successfully applied in the art (for review, see Richardson and Marasco, 1995, TIBTECH vol. 13). Intrabodies have 
been shown to virtually eliminate the expression of otherwise abundant cell surface receptors (see, e.g., Richardson 
etal., 1995, Proc. Natl. Acad. Sci. USA 92: 3137-3141; Beerli et al., 1994, J. Biol. Chem. 289: 23931-23936; 
Deshane et al., 1994, Gene Ther. 1: 332-337). 

35 Single chain antibodies comprise the variable domains of the heavy and light chain joined by a flexible 

linker polypeptide, and are expressed as a single polypeptide. Optionally, single chain antibodies are expressed as a 
single chain variable region fragment joined to the light chain constant region. Well-known intracellular trafficking 
signals are engineered into recombinant polynucleotide vectors encoding such single chain antibodies in order to 
precisely target the intrabody to the desired intracellular compartment. For example, intrabodies targeted to the 

40 endoplasmic reticulum (ER) are engineered to incorporate a leader peptide and, optionally, a C-terminal ER 
retention signal, such as the KDEL amino acid motif. Intrabodies intended to exert activity in the nucleus are 
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engineered to include a nuclear localization signal. Lipid moieties are joined to intrabodies in order to tether the 
intrabody to the cytosolic side of the plasma membrane. Intrabodies can also be targeted to exert function in the 
cytosol. For example, cytosolic intrabodies are used to sequester factors within the cytosol, thereby preventing 
them from being transported to their natural cellular destination. 
5 In one embodiment, intrabodies are used to capture 101P3A1 1 in the nucleus, thereby preventing its 

activity within the nucleus. Nuclear targeting signals are engineered into such 101P3A1 1 intrabodies in order to 
achieve the desired targeting. Such 101P3A1 1 intrabodies are designed to bind specifically to a particular 
101P3A1 1 domain. In another embodiment, cytosolic intrabodies that specifically bind to the 101P3A1 1 protein 
are used to prevent 101P3A1 1 from gaining access to the nucleus, thereby preventing it from exerting any 
10 biological activity within the nucleus (e.g., preventing 101P3A1 1 from forming transcription complexes with other 
factors). 

In order to specifically direct the expression of such intrabodies to particular cells, the transcription of the 
intrabody is placed under the regulatory control of an appropriate tumor-specific promoter and/or enhancer. In 
order to target intrabody expression specifically to prostate, for example, the PSA promoter and/or 
15 promoter/enhancer can be utilized (See, for example, U.S. Patent No. 5,919,652 issued 6 July 1999). 

XILB.) Inhibition of 101P3A11 with Recombinant Proteins 

In another approach, recombinant molecules bind to 101P3A1 1 and thereby inhibit 101P3A1 1 function. 
For example, these recombinant molecules prevent or inhibit 101P3A1 1 from accessing/binding to its binding 
partner(s) or associating with other protein(s). Such recombinant molecules can, for example, contain the reactive 

20 part(s) of a 101P3A1 1 specific antibody molecule. In a particular embodiment, the 101P3A1 1 binding domain of a 
101P3A1 1 binding partner is engineered into a dimeric fusion protein, whereby the fusion protein comprises two 
101P3A1 1 ligand binding domains linked to the Fc portion of a human IgG, such as human IgGl . Such IgG portion can 
contain, for example, the C H 2 and C H 3 domains and the hinge region, but not the C H 1 domain. Such dimeric fusion 
proteins are administered in soluble form to patients suffering from a cancer associated with the expression of 

25 101P3A1 1, whereby the dimeric fusion protein specifically binds to 101P3A1 1 and blocks 101P3A1 1 interaction with a 
binding partner. Such dimeric fusion proteins are further combined into multimeric proteins using known antibody 
linking technologies. 

XILC.) Inhibition of 101P3A11 Transcription or Translation 

The present invention also comprises various methods and compositions for inhibiting the transcription of 
30 the 101P3A1 1 gene. Similarly, the invention also provides methods and compositions for inhibiting the translation 
of 101P3A1 1 mRNA into protein. 

In one approach, a method of inhibiting the transcription of the 101P3A1 1 gene comprises contacting the 
101P3A11 gene with a 101P3A11 antisense polynucleotide. In another approach, a method of inhibiting 101P3A11 
mRNA translation comprises contacting the 101P3A1 1 mRNA with an antisense polynucleotide. In another 
35 approach, a 101P3A1 1 specific ribozyme is used to cleave the 101P3A1 1 message, thereby inhibiting translation. 
Such antisense and ribozyme based methods can also be directed to the regulatory regions of the 101P3A1 1 gene, 
such as the 101P3A1 1 promoter and/or enhancer elements. Similarly, proteins capable of inhibiting a 101P3A1 1 
gene transcription factor are used to inhibit 101P3A1 1 mRNA transcription. The various polynucleotides and 
compositions useful in the aforementioned methods have been described above. The use of antisense and ribozyme 
40 molecules to inhibit transcription and translation is well known in the art. 
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Other factors that inhibit the transcription of 101P3A1 1 by interfering with 101P3A1 1 transcriptional 
activation are also useful to treat cancers expressing 101P3A1 1. Similarly, factors that interfere with 101P3A1 1 
processing are useful to treat cancers that express 101P3A1 1. Cancer treatment methods utilizing such factors are 
also within the scope of the invention. 
5 XILD.) General Considerations for Therapeutic Strategies 

Gene transfer and gene therapy technologies can be used to deliver therapeutic polynucleotide molecules to 
tumor cells synthesizing 101P3A1 1 (i.e., antisense, ribozyme, polynucleotides encoding intrabodies and other 
101P3A1 1 inhibitory molecules). A number of gene therapy approaches are known in the art. Recombinant vectors 
encoding 101P3A1 1 antisense polynucleotides, ribozymes, factors capable of interfering with 101P3A1 1 transcription, 

10 and so forth, can be delivered to target tumor cells using such gene therapy approaches. 

The above therapeutic approaches can be combined with any one of a wide variety of surgical, chemotherapy 
or radiation therapy regimens. The therapeutic approaches of the invention can enable the use of reduced dosages of 
chemotherapy (or other therapies) and/or less frequent administration, an advantage for all patients and particularly for 
those that do not tolerate the toxicity of the chemotherapeutic agent well. 

1 5 The anti-tumor activity of a particular composition (e.g., antisense, ribozyme, intrabody), or a combination of 

such compositions, can be evaluated using various in vitro and in vivo assay systems. In vitro assays that evaluate 
therapeutic activity include cell growth assays, soft agar assays and other assays indicative of tumor promoting activity, 
binding assays capable of determining the extent to which a therapeutic composition will inhibit the binding of 
101P3A1 1 to a binding partner, etc. 

20 In vivo, the effect of a 101P3A1 1 therapeutic composition can be evaluated in a suitable animal model. For 

example, xenogenic prostate cancer models can be used, wherein human prostate cancer explants or passaged xenograft 
tissues are introduced into immune compromised animals, such as nude or SCID mice (Klein et al., 1997, Nature 
Medicine 3: 402-408). For example, PCT Patent Application W098/16628 and U.S. Patent 6,107,540, Sawyers et al., 
published April 23, 1998, describes various xenograft models of human prostate cancer capable of recapitulating the 

25 development of primary tumors, micrometastasis, and the formation of osteoblastic metastases characteristic of late 
stage disease. Efficacy can be predicted using assays that measure inhibition of tumor formation, tumor regression 
or metastasis, and the like. 

In vivo assays that evaluate the promotion of apoptosis are useful in evaluating therapeutic compositions. 
In one embodiment, xenografts from tumor bearing mice treated with the therapeutic composition can be examined 

30 for the presence of apoptotic foci and compared to untreated control xenograft-bearing mice. The extent to which 
apoptotic foci are found in the tumors of the treated mice provides an indication of the therapeutic efficacy of the 
composition. 

The therapeutic compositions used in the practice of the foregoing methods can be formulated into 
pharmaceutical compositions comprising a carrier suitable for the desired delivery method. Suitable carriers include 
35 any material that when combined with the therapeutic composition retains the anti-tumor function of the therapeutic 
composition and is generally non-reactive with the patient's immune system. Examples include, but are not limited 
to, any of a number of standard pharmaceutical carriers such as sterile phosphate buffered saline solutions, 
bacteriostatic water, and the like (see, generally, Remington's Pharmaceutical Sciences 16 th Edition, A. Osal., Ed., 
1980). 

40 Therapeutic formulations can be solubilized and administered via any route capable of delivering the 

therapeutic composition to the tumor site. Potentially effective routes of administration include, but are not limited 
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to, intravenous, parenteral, intraperitoneal, intramuscular, intratumor, intradermal, intraorgan, orthotopic, and the 
like. A preferred formulation for intravenous injection comprises the therapeutic composition in a solution of 
preserved bacteriostatic water, sterile unpreserved water, and/or diluted in polyvinylchloride or polyethylene bags 
containing 0.9% sterile Sodium Chloride for Injection, USP. Therapeutic protein preparations can be lyophilized 
5 and stored as sterile powders, preferably under vacuum, and then reconstituted in bacteriostatic water (containing 
for example, benzyl alcohol preservative) or in sterile water prior to injection. 

Dosages and administration protocols for the treatment of cancers using the foregoing methods will vary with 
the method and the target cancer, and will generally depend on a number of other factors appreciated in the art. 

10 XIII.) Kits 

For use in the diagnostic and therapeutic applications described herein, kits are also within the scope of the 
invention. Such kits can comprise a carrier, package or container that is compartmentalized to receive one or more 
containers such as vials, tubes, and the like, each of the containers) comprising one of the separate elements to be 
used in the method. For example, the containers) can comprise a probe that is or can be detectably labeled. Such 

15 probe can be an antibody or polynucleotide specific for a 101P3A1 1 -related protein or a 101P3A1 1 gene or 

message, respectively. Where the method utilizes nucleic acid hybridization to detect the target nucleic acid, the kit 
can also have containers containing nucleotide(s) for amplification of the target nucleic acid sequence and/or a 
container comprising a reporter-means, such as a biotin-binding protein, such as avidin or streptavidin, bound to a 
reporter molecule, such as an enzymatic, florescent, or radioisotope label. The kit can include all or part of the 

20 amino acid sequence of Figure 2 or Figure 3 or analogs thereof, or a nucleic acid molecules that encodes such 
amino acid sequences or which hybridize to such sequences. 

The kit of the invention will typically comprise the container described above and one or more other containers 
comprising materials desirable from a commercial and user standpoint, including buffers, diluents, filters, needles, 
syringes, and package inserts with instructions for use. Any physical object in a kit of the invention, e.g. syringe or vial, 

25 can have a label affixed thereto. A label or package insert can have instructions for use in human administration printed 
thereon. Thus, a label can be present on the container to indicate that the composition is used for a specific therapy or 
non-therapeutic application, and can also indicate directions for either in vivo or in vitro use, such as those described 
above. Directions and or other information can also be included on an insert that is included with the kit. 

30 EXAMPLES 

Various aspects of the invention are further described and illustrated by way of the several examples that 
follow, none of which are intended to limit the scope of the invention. 

Example 1: Expression analysis of 101 P3 All in normal tissues and patient specimens 

35 Analysis of 101P3A1 1 by RT-PCR is shown in Figure 10A. First strand cDNA was prepared from vital 

pool 1 (VP1: liver, lung and kidney), vital pool 2 (VP2, pancreas, colon and stomach), prostate xenograft pool, 
prostate cancer pool, kidney cancer pool, colon cancer pool, breast cancer pool, and cancer metastasis pool. 
Normalization was performed by PCR using primers to actin and GAPDH. Semi-quantitative PCR, using primers to 
101P3A1 1, was performed at 30 cycles of amplification. Expression of 101P3A1 1 was observed in prostate 

40 xenograft pool, prostate cancer pool, kidney cancer pool, colon cancer pool, breast cancer pool, and cancer 

metastasis pool, but not in VP1 and VP2. Dot blots using patient-derived amplified cDNAs (Clontech, CA) show 
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upregulation of PHOR-1 in 3/3 prostate cancer patients, 6/14 kidney cancer patients, 2/8 uterine cancer, 1/1 cervical 
cancer, 3/8 stomach cancer, and in 7/7 rectal cancer patients (Figure 10B). Expression of 101P3A1 1 was assayed in 
a panel of human patient cancer specimens (Figure 11). RNA was extracted from a pool of three prostate cancer 
tumors, kidney cancer tumors, colon cancer tumors, breast cancer tumors, and cancer metastasis pool derived from 
5 cancer patients, as well as from normal prostate (NP), normal bladder (NB), normal kidney (NK) and normal colon 
(NC). Northern blots with 10 (ig of total RN A/lane were probed with a 101P3A1 1 sequence fragment. Size 
standards in kilobases (kb) are indicated on the side. The results show expression of 101P3A1 1 in prostate cancer 
tumors, kidney cancer tumors, colon cancer tumors, breast cancer tumors, cancer metastasis pool, bladder cancer 
pool, and in the normal prostate but not in the other normal tissues. 
10 Northern blot analysis on individual prostate patient tumor specimens is shown in Figure 12 A. RNA was 

extracted from prostate tumors (T) and their normal adjacent tissues (Nat) derived from prostate cancer patients. 
Northern blots with 10 |ig of total RNA/lane were probed with 101P3A1 1 sequence. Results showed expression of 
101P3A1 1 in all three patient specimens, and expression is especially upregulated in one of the three prostate tumor 
tissues. 

15 RNA in situ analysis using anti-sense 101P3A1 1 riboprobe showed significant glandular epithelial and 

basal cell expression in normal prostate (4/4), PIN (1/1), and prostate cancer (6/6) patients. 101P3A1 1 sense 
riboprobe had little to no staining. The RNA in situ staining in PIN and prostate cancer is shown in Figure 12B and 
12C. The staining intensity in the cancer cells was generally higher than that observed in normal glands (Figure 
12D and 12E). The RNA in situ results also demonstrate that the expression observed in the prostate tissues is in 

20 the glandular epithelia, basal cells, and cancer cells. 

Endogenous expression of the 101P3A1 1 protein is demonstrated in the immunohistochemistry analysis of 
the anti-101P3All (PEPTIDE 1: amino acids 1-14) rabbit polyclonal antibody (Figure 40A-40F). Staining in 
prostate cancer is greater than the staining observed in normal prostate. The staining is localized apically within the 
luminal epithelia of the normal prostate (Figures 40E and 40F). The staining observed in prostate cancer is also 

25 localized apically in low to intermediate grade cancer (Figures 40B and 40C) and throughout all cells of more 
advanced prostate cancer (Figure 40A). Staining was observed in 19/20 normal prostate patients and in all of the 
nineteen prostate cancer patients analyzed. The prostate cancer cell line, LNCaP also shows similar staining 
(Figures 40D and 40F) in almost all cells. 

In addition, the present protocol was used to identify endogenous expression of the 101P3A1 1 protein in 

30 prostate cancer, bladder cancer, kidney cancer, colon cancer, lung cancer, and breast cancer. Immunohistochemical 
analysis was performed with the anti-101P3Al 1 (PEPTIDE 1: amino acids 1-14) rabbit polyclonal antibody 
(prostate cancer, Figure 41 A; bladder cancer, Figure 4 IB; kidney cancer, Figure 41C; colon cancer, lung cancer, 
Figure 4 IE; and breast cancer, Figure 4 IF). Specific staining is observed in tumor cells of the six cancers analyzed. 
Expression of 101P3A1 1 was also detected in the tumors of two colon cancer patients but not in normal 

35 colon tissues (Figure 13), and in five out of six kidney tumors isolated from kidney cancer patients (Figure 14). The 
expression detected in normal adjacent tissues (isolated from diseased tissues) but not in normal tissues of the 
kidney (isolated from healthy donors) indicates that these tissues are not fully normal and that 101P3A1 1 is 
expressed in early stage tumors. In order to assay for androgen regulation of 101P3A1 1 expression, LAPC-9 cells 
were grown in charcoal-stripped medium and stimulated with the synthetic androgen mibolerone, for either 14 or 24 

40 hours (Figure 15 A, Figure 15B, and Figure 15C). Northern blots with 10 jag of total RNA/lane were probed with 
the 101P3A1 1 sequences (Figure 15 A). A picture of the ethidium-bromide staining of the RNA gel is also 
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presented (Figure 15C). Results showed expression of 101P3A1 1 is not regulated by androgen. The experimental 
samples were confirmed by testing for the expression of the androgen-regulated prostate cancer gene PSA (Figure 
15B). This experiment showed that, as expected, PSA levels go down in presence of charcoal-stripped serum, and 
expression is induced at 14 and 24 hours in presence of mibolerone. 
5 Analysis of androgen regulation of 101P3A1 1 in vivo is shown in Figure 16. Male mice were injected with 

LAPC-9AD tumor cells. When tumors reached a palpable size, mice were castrated and tumors harvested at 
different time points following castration. RNA was isolated from the xenograft tissues. Northern blots with 10 jig 
of total RNA/lane were probed with 101P3A1 1 sequences. Size standards in kilobases (kb) are indicated on the 
side. A picture of the ethidium-bromide staining of the RNA gel is also presented in Figure 16. The results showed 
10 that expression of 101P3A1 1 was not affected by androgen deprivation, and therefore, is not androgen regulated. 

Example 2: Production of Recombinant 101P3A11 in Prokarvotic and Yeast Systems 

To express recombinant 101P3A1 1 in prokaryotic cells, the full or partial length 101P3A1 1 cDNA 
1 5 sequences can be cloned into any one of a variety of expression vectors known in the art. One or more of the 

following regions of 101P3A1 1 are expressed in these constructs, amino acids 1 to 317; or any 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more contiguous amino acids from 101P3A11, 
variants, or analogs thereof. 

A. In vitro transcription and translation constructs: 

20 pCRII: To generate 101P3A1 1 sense and anti-sense RNA probes for RNA in situ investigations, pCRII 

constructs (Invitrogen, Carlsbad, CA) are generated encoding either all or fragments of the 101P3A1 1 cDNA. The 
pCRII vector has Sp6 and T7 promoters flanking the insert to drive the transcription of 101P3A1 1 RNA for use as 
probes in RNA in situ hybridization experiments. These probes are used to analyze the cell and tissue expression of 
101P3A1 1 at the RNA level. Transcribed 101P3A1 1 RNA representing the cDNA amino acid coding region of the 

25 101P3A1 1 gene is used in in vitro translation systems such as the TnT™ Coupled Reticulolysate System (Promega, 
Corp., Madison, WI) to synthesize 101P3A1 1 protein. 

B. Bacterial Constructs: 

pGEX Constructs: To generate recombinant 101P3A1 1 proteins in bacteria that are fused to the 
Glutathione S-transferase (GST) protein, all or parts of the 101P3A1 1 cDNA protein coding sequence are fused to 

30 the GST gene by cloning into pGEX-6P-l or any other GST- fusion vector of the pGEX family (Amersham 

Pharmacia Biotech, Piscataway, NJ). These constructs allow controlled expression of recombinant 101P3A1 1 
protein sequences with GST fused at the amino-terminus and a six histidine epitope (6X His) at the carboxyl- 
terminus. The GST and 6X His tags permit purification of the recombinant fusion protein from induced bacteria 
with the appropriate affinity matrix and allow recognition of the fusion protein with anti-GST and anti-His 

35 antibodies. The 6X His tag is generated by adding 6 histidine codons to the cloning primer at the 3' end, e.g., of the 
open reading frame (ORF). A proteolytic cleavage site, such as the PreScission™ recognition site in pGEX-6P-l, 
can be employed that permits cleavage of the GST tag from 101P3A1 1 -related protein. The ampicillin resistance 
gene and pBR322 origin permit selection and maintenance of the pGEX plasmids in E. coli. In one embodiment, 
amino acids 86-317 are cloned into the pGEX-2T expression vector, the protein is expressed and purified. 

40 pMAL Constructs: To generate, in bacteria, recombinant 101P3A1 1 proteins that are fused to maltose- 

binding protein (MBP), all or parts of the 101P3A1 1 cDNA protein coding sequence are fused to the MBP gene by 
cloning into the pMAL-c2X and pMAL-p2X vectors (New England Biolabs, Beverly, MA). These constructs allow 
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controlled expression of recombinant 101P3A1 1 protein sequences with MBP fused at the amino -terminus and a 6X 
His epitope tag at the carboxyl-terminus. The MBP and 6X His tags permit purification of the recombinant protein 
from induced bacteria with the appropriate affinity matrix and allow recognition of the fusion protein with anti- 
MBP and anti-His antibodies. The 6X His epitope tag is generated by adding 6 histidine codons to the 3' cloning 
5 primer. A Factor Xa recognition site permits cleavage of the pMAL tag from 101P3A1 1 . The pMAL-c2X and 
pMAL-p2X vectors are optimized to express the recombinant protein in the cytoplasm or periplasm respectively. 
Periplasm expression enhances folding of proteins with disulfide bonds. In one embodiment, amino acids 86-310 
is cloned into the pMAL-c2X expression vector, the protein is expressed and purified. 

pET Constructs : To express 101P3A1 1 in bacterial cells, all or parts of the 101P3A1 1 cDNA protein 

10 coding sequence are cloned into the pET family of vectors (Novagen, Madison, WI). These vectors allow tightly 

controlled expression of recombinant 101P3A1 1 protein in bacteria with and without fusion to proteins that enhance 
solubility, such as NusA and thioredoxin (Trx), and epitope tags, such as 6X His and S-Tag ™ that aid purification 
and detection of the recombinant protein. For example, constructs are made utilizing pET NusA fusion system 43. 1 
such that regions of the 101P3A1 1 protein are expressed as amino-terminal fusions to NusA. 

15 C. Yeast Constructs: 

pESC Constructs : To express 101P3A1 1 in the yeast species Saccharomyces cerevisiae for generation of 
recombinant protein and functional studies, all or parts of the 101P3A1 1 cDNA protein coding sequence are cloned 
into the pESC family of vectors each of which contain 1 of 4 selectable markers, HIS3, TRP1, LEU2, and URA3 
(Stratagene, La Jolla, CA). These vectors allow controlled expression from the same plasmid of up to 2 different 

20 genes or cloned sequences containing either Flag™ or Myc epitope tags in the same yeast cell. This system is used 
to confirm protein-protein interactions of 101P3A1 1 . In addition, expression in yeast yields similar post- 
translational modifications, such as glycosylations and phosphorylations, that are found when expressed in 
eukaryotic cells. 

pESP Constructs : To express 101P3A1 1 in the yeast species Saccharomyces pombe, all or parts of the 
25 101P3A1 1 cDNA protein coding sequence are cloned into the pESP family of vectors. These vectors allow 

controlled high level expression of a 101P3A1 1 protein sequence that is fused at either the amino terminus or at the 
carboxyl terminus to GST which aids purification of the recombinant protein. A Flag™ epitope tag allows 
detection of the recombinant protein with anti- Flag™ antibody. 

30 Example 3: Production of Recombinant 101P3A11 in Higher Eukaryotic Systems 

A. Mammalian Constructs: 

To express recombinant 101P3A1 1 in eukaryotic cells, full or partial length 101P3A1 1 cDNA sequences 
can be cloned into any one of a variety of expression vectors known in the art. One or more of the following 
regions of 101P3A1 1 are expressed in these constructs, amino acids 1 to 317; or any 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
35 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more contiguous amino acids from 101P3A1 1, 
variants, or analogs thereof. 

The constructs can be transfected into any one of a wide variety of mammalian cells such as 293T cells. 
Transfected 293T cell lysates can be probed with the anti-101P3Al 1 polyclonal serum, described herein. 

pcDNA4/HisMax Constructs: To express 101P3A1 1 in mammalian cells, the 101P3A1 1 ORF was cloned 
40 into pcDNA4/HisMax Version A (Invitrogen, Carlsbad, CA). Protein expression is driven from the 

cytomegalovirus (CMV) promoter and the SP16 translational enhancer. The recombinant protein has Xpress™ and 
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six histidine (6X His) epitopes fused to the amino- terminus. The pcDNA4/HisMax vector also contains the bovine 
growth hormone (BGH) polyadenylation signal and transcription termination sequence to enhance mRNA stability 
along with the SV40 origin for episomal replication and simple vector rescue in cell lines expressing the large T 
antigen. The Zeocin resistance gene allows for selection of mammalian cells expressing the protein and the 
5 ampicillin resistance gene and CoLEl origin permits selection and maintenance of the plasmid in E. coli. 

pcDNA3.1/MvcHis Constructs: To express 101P3A1 1 in mammalian cells, the 101P3A1 1 ORF, with a 
consensus Kozak translation initiation site, was cloned into pcDNA3.1/MycHis Version A (Invitrogen, Carlsbad, 
CA). Protein expression is driven from the cytomegalovirus (CMV) promoter. The recombinant proteins have the 
myc epitope and 6X His epitope fused to the carboxyl- terminus. The pcDNA3.1/MycHis vector also contains the 
10 bovine growth hormone (BGH) polyadenylation signal and transcription termination sequence to enhance mRNA 
stability, along with the SV40 origin for episomal replication and simple vector rescue in cell lines expressing the 
large T antigen. The Neomycin resistance gene can be used, as it allows for selection of mammalian cells 
expressing the protein and the ampicillin resistance gene and ColEl origin permits selection and maintenance of the 
plasmid in E. coli. 

15 PCDNA3.1/CT-GFP-TOPO Construct: To express 101P3A1 1 in mammalian cells and to allow 

detection of the recombinant proteins using fluorescence, the 101P3A1 1 ORF, with a consensus Kozak translation 
initiation site, was cloned into pcDNA3. 1/CT-GFP-TOPO (Invitrogen, CA). Protein expression is driven from the 
cytomegalovirus (CMV) promoter. The recombinant proteins have the Green Fluorescent Protein (GFP) fused to 
the carboxyl-terminus facilitating non-invasive, in vivo detection and cell biology studies. The pcDNA3.1CT-GFP- 

20 TOPO vector also contains the bovine growth hormone (BGH) polyadenylation signal and transcription termination 
sequence to enhance mRNA stability along with the SV40 origin for episomal replication and simple vector rescue 
in cell lines expressing the large T antigen. The Neomycin resistance gene allows for selection of mammalian cells 
that express the protein, and the ampicillin resistance gene and ColEl origin permits selection and maintenance of 
the plasmid in E. coli. Additional constructs with an amino-terminal GFP fusion are made in pcDNA3.1/NT-GFP- 

25 TOPO spanning the entire length of the 101P3A1 1 proteins. 

PAPtag: The 101P3A1 1 ORF, or portions thereof, of 101P3A1 1 are cloned into pAPtag-5 (GenHunter 
Corp. Nashville, TN). This construct generates an alkaline phosphatase fusion at the carboxyl-terminus of the 
101P3A1 1 proteins while fusing the IgGK signal sequence to the amino-terminus. Constructs are also generated in 
which alkaline phosphatase with an amino-terminal IgGK signal sequence is fused to the amino-terminus of 

30 101P3A1 1 proteins. The resulting recombinant 101P3A1 1 proteins are optimized for secretion into the media of 
transfected mammalian cells and can be used to identify proteins such as ligands or receptors that interact with the 
101P3A1 1 proteins. Protein expression is driven from the CMV promoter and the recombinant proteins also 
contain myc and 6X His epitopes fused at the carboxyl-terminus that facilitates detection and purification. The 
Zeocin resistance gene present in the vector allows for selection of mammalian cells expressing the recombinant 

35 protein and the ampicillin resistance gene permits selection of the plasmid in E. coli. 

ptag5: The 101P3A1 1 ORF, or portions thereof, of 101P3A1 1 are cloned into pTag-5. This vector is 
similar to pAPtag but without the alkaline phosphatase fusion. This construct generated 101P3A1 1 protein with an 
amino-terminal IgGK signal sequence and myc and 6X His epitope tags at the carboxyl-terminus that facilitate 
detection and affinity purification. The resulting recombinant 101P3A1 1 protein was optimized for secretion into 

40 the media of transfected mammalian cells, and was used as immunogen or ligand to identify proteins such as 
ligands or receptors that interact with the 101P3A1 1 proteins. Protein expression is driven from the CMV 
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promoter. The Zeocin resistance gene present in the vector allows for selection of mammalian cells expressing the 
protein, and the ampicillin resistance gene permits selection of the plasmid in E. coli. 

PsecFc: The 101P3A1 1 ORF, or portions thereof, of 101P3A1 1 are also cloned into psecFc. The psecFc 
vector was assembled by cloning the human immunoglobulin Gl (IgG) Fc (hinge, CH2, CH3 regions) into 
5 pSecTag2 (Invitrogen, California). This construct generates an IgGl Fc fusion at the carboxyl-terminus of the 

101P3A1 1 proteins, while fusing the IgGK signal sequence to N-terminus. 101P3A1 1 fusions utilizing the murine 
IgGl Fc region are also used. The resulting recombinant 101P3A1 1 proteins are optimized for secretion into the 
media of transfected mammalian cells, and can be used as immunogens or to identify proteins such as ligands or 
receptors that interact with the 101P3A1 1 protein. Protein expression is driven from the CMV promoter. The 

10 hygromycin resistance gene present in the vector allows for selection of mammalian cells that express the 
recombinant protein, and the ampicillin resistance gene permits selection of the plasmid in E. coli. 

The amino acid region 159-202 of the 101P3A1 1 ORF was cloned into psecFc. The resulting recombinant 
101P3A1 l(159-202)-psecFc construct was transfected into 293T and Cos-7 cells, and the expression of 
recombinant 101P3A1 l(159-202)-psecFc protein assayed by Western blotting (Figure 17). Results show that 

15 101P3A1 l(159-202)-psecFc fusion protein was expressed in the lysates of both 293T and Cos-7 cells. The 

101P3A1 l(159-202)-psecFc fusion protein was also secreted and detected in the culture supernatants of both cell 
types. 

pSRq Constructs: To generate mammalian cell lines that express 101P3A1 1, constitutively, the ORF of 
101P3A1 1 was cloned into pSRa constructs. Amphotropic and ecotropic retroviruses were generated by 

20 transfection of pSRa constructs into the 293T-10A1 packaging line or co-transfection of pSRa and a helper plasmid 
(containing deleted packaging sequences) into the 293 cells, respectively. The retrovirus was used to infect a 
variety of mammalian cell lines, resulting in the integration of the cloned gene, 101P3A1 1, into the host cell lines. 
Protein expression is driven from a long terminal repeat (LTR). The Neomycin resistance gene present in the vector 
allows for selection of mammalian cells that express the protein, and the ampicillin resistance gene and ColEl 

25 origin permit selection and maintenance of the plasmid in E. coli. Figure 18 shows that 101P3A1 1 was expressed 
using the pSRa retroviral vector in the cell line 300.19. The retroviral vectors can thereafter be used for infection 
and generation of various cell lines using, for example, PC3, NIH 3T3, TsuPrl, 293 or rat-1 cells. 

Additional pSRa constructs are made that fuse an epitope tag such as the FLAG™ tag to the carboxyl- 
terminus of 101P3A1 1 sequences to allow detection using anti-Flag antibodies. For example, the FLAG™ 

30 sequence 5' gat tac aag gat gac gac gat aag 3' is added to cloning primer at the 3' end of the ORF. Additional pSRa 
constructs are made to produce both amino-terminal and carboxyl-terminal GFP and myc/6X His fusion proteins of 
the full-length 101P3A11 proteins. 

Additional Viral Vectors: Additional constructs are made for viral-mediated delivery and expression of 
101P3A1 1. High virus titer leading to high level expression of 101P3A1 1 is achieved in viral delivery systems such 

35 as adenoviral vectors and herpes amplicon vectors. The 101P3A1 1 coding sequences or fragments thereof are 

amplified by PCR and subcloned into the AdEasy shuttle vector (Stratagene). Recombination and virus packaging 
are performed according to the manufacturer's instructions to generate adenoviral vectors. Alternatively, 
101P3A1 1 coding sequences or fragments thereof are cloned into the HSV-1 vector (Imgenex) to generate herpes 
viral vectors. The viral vectors are thereafter used for infection of various cell lines such as SCaBER, NIH 3T3, 

40 293 or rat-1 cells. 
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Regulated Expression Systems: To control expression of 101P3A1 1 in mammalian cells, coding 
sequences of 101P3A1 1, or portions thereof, are cloned into regulated mammalian expression systems such as the 
T-Rex System (Invitrogen), the GeneSwitch System (Invitrogen) and the tightly-regulated Ecdysone System 
(Sratagene). These systems allow the study of the temporal and concentration dependent effects of recombinant 
5 101P3A1 1 . These vectors are thereafter used to control expression of 101P3A1 1 in various cell lines such as 
SCaBER, NIH 3T3, 293 or rat-1 cells. 

B. Baculovirus Expression Systems 

To generate recombinant 101P3A1 1 proteins in a baculovirus expression system, 101P3A1 1 ORF, or 
portions thereof, are cloned into the baculovirus transfer vector pBlueBac 4.5 (Invitrogen), which provides a His-tag 
10 at the N-terminus. Specifically, pBlueBac-101P3Al 1 is co-transfected with helper plasmid pBac-N-Blue 

(Invitrogen) into SF9 (Spodoptera Jrugiperda) insect cells to generate recombinant baculovirus (see Invitrogen 
instruction manual for details). Baculovirus is then collected from cell supernatant and purified by plaque assay. 

Recombinant 101P3A1 1 protein is then generated by infection of HighFive insect cells (Invitrogen) with 
purified baculovirus. Recombinant 101P3A1 1 protein can be detected using anti-101P3Al 1 or anti-His-tag 
15 antibody. 101P3A1 1 protein can be purified and used in various cell-based assays or as immunogen to generate 
polyclonal and monoclonal antibodies specific for 101P3A1 1. 


Example 4 Antigenicity Profiles and Secondary Structure 

Figure 5, Figure 6, Figure 7, Figure 8, and Figure 9 depict graphically five amino acid profiles of the 
20 101P3A1 1 amino acid sequence, each assessment available by accessing the ProtScale website (URL 
www.expasy.ch/cgi-bin/protscale. pi) on the ExPasy molecular biology server. 

These profiles: Figure 5, Hydrophilicity, (Hopp T.P., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 
78:3824-3828); Figure 6, Hydropathicity, (Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132); Figure 7, 
Percentage Accessible Residues (Janin J., 1979 Nature 277:491-492); Figure 8, Average Flexibility, (Bhaskaran R., 
25 and Ponnuswamy P.K., 1988. Int. J. Pept. Protein Res. 32:242-255); Figure 9, Beta-turn (Deleage, G., Roux B. 1987 
Protein Engineering 1:289-294); and optionally others available in the art, such as on the ProtScale website, were 
used to identify antigenic regions of the 101P3A1 1 protein. Each of the above amino acid profiles of 101P3A1 1 
were generated using the following ProtScale parameters for analysis: 1) A window size of 9; 2) 100% weight of 
the window edges compared to the window center; and, 3) amino acid profile values normalized to lie between 0 
30 and 1. 

Hydrophilicity (Figure 5), Hydropathicity (Figure 6) and Percentage Accessible Residues (Figure 7) 
profiles were used to determine stretches of hydrophilic amino acids (i.e., values greater than 0.5 on the 
Hydrophilicity and Percentage Accessible Residues profile, and values less than 0.5 on the Hydropathicity profile). 
Such regions are likely to be exposed to the aqueous environment, be present on the surface of the protein, and thus 

35 are available for immune recognition, such as by antibodies. 

Average Flexibility (Figure 8) and Beta-turn (Figure 9) profiles determine stretches of amino acids (i.e., 
values greater than 0.5 on the Beta- turn profile and the Average Flexibility profile) that are not constrained in 
secondary structures such as beta sheets and alpha helices. Such regions are also more likely to be exposed portions 
of the protein and thus are accessible to immune recognition, such as by antibodies. 

40 Antigenic sequences of the 101P3A1 1 protein indicated, e.g., by the profiles set forth in Figure 5, Figure 6, 

Figure 7, Figure 8, and/or Figure 9 are used to prepare immunogens, either peptides or nucleic acids that encode 
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them, to generate anti-101P3Al 1 antibodies. The immunogen can be any 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50 or more than 50 contiguous amino acids, or the corresponding 
nucleic acids that encode them, from the 101P3A1 1 protein. In particular, peptide immunogens of the invention can 
comprise, a peptide region of at least 5 amino acids of Figure 2 in any whole number increment up to 317 that 
5 includes an amino acid position having a value greater than 0.5 in the Hydrophilicity profile of Figure 5; a peptide 
region of at least 5 amino acids of Figure 2 in any whole number increment up to 317 that includes an amino acid 
position having a value less than 0.5 in the Hydropathic ity profile of Figure 6; a peptide region of at least 5 amino 
acids of Figure 2 in any whole number increment up to 3 17 that includes an amino acid position having a value 
greater than 0.5 in the Percent Accessible Residues profile of Figure 7; a peptide region of at least 5 amino acids of 
10 Figure 2 in any whole number increment up to 3 17 that includes an amino acid position having a value greater than 
0.5 in the Average Flexibility profile on Figure 8; and, a peptide region of at least 5 amino acids of Figure 2 in any 
whole number increment up to 317 that includes an amino acid position having a value greater than 0.5 in the Beta- 
turn profile of Figure 9. Peptide immunogens of the invention can also comprise nucleic acids that encode any of 
the forgoing. 

1 5 All immunogens of the invention, peptide or nucleic acid, can be embodied in human unit dose form, or 

comprised by a composition that includes a pharmaceutical excipient compatible with human physiology. 

The secondary structure of 101P3A1 1, namely the predicted presence and location of alpha helices, 

extended strands, and random coils, is predicted from the primary amino acid sequence using the FINN - 

Hierarchical Neural Network method (Guermeur, 1997, http://pbil.ibcp.fr/cgi- 
20 bin/npsa_automat.pl?page=npsa _nn.html), accessed from the ExPasy molecular biology server 

http://www.expasy.ch/tools/. The analysis indicates that 101P3A1 1 is composed 47.95% alpha helix, 21.45% 

extended strand, and 30.60% random coil (Figure 19 A). 

Analysis for the potential presence of transmembrane domains in 101P3A1 1 was carried out using a 

variety of transmembrane prediction algorithms accessed from the ExPasy molecular biology server 
25 http://www.expasy.ch/tools. The programs predict the presence of 7 transmembrane domains in 101P3A1 1, 

consistent with the structure of a G-protein coupled receptor. Shown graphically in Figure 19A are the results of 

analysis using the TMpred (Figure 19B) and TMHMM (Figure 19C) prediction programs depicting the location of 

the 7 transmembrane domains. The results of each program, namely the amino acids encoding the transmembrane 

domains are summarized in Table XXV. 

30 

Example 5: Generation of 101P3A11 Polyclonal Antibodies 

Polyclonal antibodies can be raised in a mammal, for example, by one or more injections of an immunizing 
agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal 
by multiple subcutaneous or intraperitoneal injections. In addition to immunizing with the full length 101P3A1 1 

35 protein, computer algorithms are employed in design of immunogens that, based on amino acid sequence analysis 
are antigenic and available for recognition by the immune system of the immunized host (see the Example entitled 
"Antigenicity Profiles and Secondary Structure"). Such regions would generally be hydrophilic, flexible, in beta- 
turn conformations, and/or exposed on the surface of the protein (see, e.g., Figure 5, Figure 6, Figure 7, Figure 8, or 
Figure 9 for amino acid profiles that indicate such regions of 101P3A1 1). 

40 For example, 101P3A1 1 recombinant bacterial fusion proteins or peptides containing hydrophilic, flexible, 

beta-turn regions of the 101P3A1 1 amino acid sequence, such as amino acids 1-23, plus or minus 1-10 amino acids 
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at available termini, and amino acids 159-202, plus or minus 1-10 amino acids at available termini, are used as 
antigens to generate polyclonal antibodies in New Zealand White rabbits. It is useful to conjugate the immunizing 
agent to a protein known to be immunogenic in the mammal being immunized. Examples of such immunogenic 
proteins include, but are not limited to, keyhole limpet hemocyamn (KLH), serum albumin, bovine thyroglobulin, 
5 and soybean trypsin inhibitor. In one embodiment, a peptide encoding amino acids 1-23 of 101P3A1 1 is conjugated 
to KLH and used to immunize the rabbit. Alternatively the immunizing agent may include all or portions of the 
101P3A1 1 protein, analogs or fusion proteins thereof. For example, the 101P3A1 1 amino acid sequence can be 
fused using recombinant DNA techniques to any one of a variety of fusion protein partners that are well known in 
the art, such as glutathione- S -transferase (GST) and HIS tagged fusion proteins. Such fusion proteins are purified 

10 from induced bacteria using the appropriate affinity matrix. 

In one embodiment, a GST-fusion protein encoding amino acids 86-3 17, plus or minus 1-10 amino acids at 
available termini, is produced and purified and used as immunogen. Other recombinant bacterial fusion proteins 
that may be employed include maltose binding protein, LacZ, thioredoxin, NusA, or an immunoglobulin constant 
region (see the section entitled "Production of 101P3A1 1 in Prokaryotic Systems" and Current Protocols In 

15 Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubul et al. eds., 1995; Linsley, P.S., Brady, W., Urnes, M., 
Grosmaire, L., Damle, N., and Ledbetter, L.(1991) J.Exp. Med. 174, 561-566). 

In addition to bacterial derived fusion proteins, mammalian expressed protein antigens are also used. 
These antigens are expressed from mammalian expression vectors such as the Tag5 and Fc-fusion vectors (see the 
section entitled "Production of Recombinant 101P3A1 1 in Eukaryotic Systems"), and retain post-translational 

20 modifications such as glycosylations found in native protein. In one embodiment, amino acids 1 59-202 is cloned 
into the Tag5 mammalian secretion vector. The recombinant protein is purified by metal chelate chromatography 
from tissue culture supernatants of 293T cells stably expressing the recombinant vector. The purified Tag5 
101P3A1 1 protein is then used as immunogen. 

During the immunization protocol, it is useful to mix or emulsify the antigen in adjuvants that enhance the 

25 immune response of the host animal. Examples of adjuvants include, but are not limited to, complete Freund's 
adjuvant (CFA) and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

In a typical protocol, rabbits are initially immunized subcutaneously with up to 200 |Xg, typically 100-200 
(ig, of fusion protein or peptide conjugated to KLH mixed in complete Freund's adjuvant (CFA). Rabbits are then 
injected subcutaneously every two weeks with up to 200 |J.g, typically 100-200 jag, of the immunogen in incomplete 

30 Freund's adjuvant (IF A). Test bleeds are taken approximately 7-10 days following each immunization and used to 
monitor the titer of the antiserum by ELISA. 

To test reactivity and specificity of immune serum, such as the rabbit serum derived from immunization 
with Tag5 101P3A1 1 encoding amino acids 159-202, the full-length 101P3A1 1 cDNA is cloned into pCDNA 3.1 
myc-his expression vector (Invitrogen, see the Example entitled "Production of Recombinant 101P3A1 1 in 

35 Eukaryotic Systems"). After transfection of the constructs into 293T cells, cell lysates are probed with the anti- 
101P3A1 1 serum and with anti-His antibody (Santa Cruz Biotechnologies, Santa Cruz, CA) to determine specific 
reactivity to denatured 101P3A1 1 protein using the Western blot technique. Immunoprecipitation and flow 
cytometric analyses of 293T and other recombinant 101P3A1 1 -expressing cells determine recognition of native 
protein by the antiserum. In addition, Western blot, immunoprecipitation, fluorescent microscopy, and flow 

40 cytometric techniques using cells that endogenously express 101P3A1 1 are carried out to test specificity. 
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The anti-serum from the Tag5 101P3A1 1 immunized rabbit is affinity purified by passage over a column 
composed of the Tag5 antigen covalently coupled to Affigel matrix (BioRad, Hercules, Calif.). The serum is then 
further purified by protein G affinity chromatography to isolate the IgG fraction. Serum from rabbits immunized 
with fusion proteins, such as GST and MBP fusion proteins, are purified by depletion of antibodies reactive to the 
5 fusion partner sequence by passage over an affinity column containing the fusion partner either alone or in the 

context of an irrelevant fusion protein. Sera from other His-tagged antigens and peptide immunized rabbits as well 
as fusion partner depleted sera are affinity purified by passage over a column matrix composed of the original 
protein immunogen or free peptide. 

10 Example 6: Generation of 101P3A11 Monoclonal Antibodies (mAbs) 

In one embodiment, therapeutic mAbs to 101P3A1 1 comprise those that react with epitopes of the protein 
that would disrupt or modulate the biological function of 101P3A1 1, for example those that would disrupt its 
interaction with ligands or proteins that mediate or are involved in its biological activity. Therapeutic mAbs also 
comprise those that specifically bind epitopes of 101P3A1 1 exposed on the cell surface and thus are useful in 

15 targeting mAb-toxin conjugates. Immunogens for generation of such mAbs include those designed to encode or 

contain the entire 101P3A1 1 protein or regions of the 101P3A1 1 protein predicted to be exposed to the extracellular 
environment and/or antigenic from computer analysis of the amino acid sequence (see, e.g., Figure 5, Figure 6, 
Figure 7, Figure 8, or Figure 9, and the Example entitled "Antigenicity Profiles and Secondary Structure"). 
Immunogens include peptides, recombinant bacterial proteins, and mammalian expressed Tag 5 proteins and human 

20 and murine IgG FC fusion proteins. In addition, cells expressing high levels of 101P3A1 1 , such as 293T- 
101P3A11 or 300.19-101P3A1 1 murine Pre-B cells, are used to immunize mice. 

To generate mAbs to 101P3A1 1, mice are first immunized intraperitoneally (IP) with, typically, 10-50 jiig 
of protein immunogen or 10 7 101P3A1 1 -expressing cells mixed in complete Freund's adjuvant. Alternatively, 
mice are immunized intradermally. Mice are then subsequently immunized IP every 2-4 weeks with, typically, 10- 

25 50 \ig of protein immunogen or 10 7 cells mixed in incomplete Freund's adjuvant. Alternatively, MPL-TDM 

adjuvant is used in immunizations. In addition to the above protein and cell-based immunization strategies, a DNA- 
based immunization protocol is employed in which a mammalian expression vector encoding 101P3A1 1 sequence 
is used to immunize mice by direct injection of the plasmid DNA. For example, the predicted first extracellular 
loop, amino acids 82-104, or second extracellular loop of 101P3A1 1, amino acids 159-202, or the third 

30 extracellular loop, amino acids 258 - 275 (in each instance plus or minus 10 amino acids) is cloned into the Tag5 
mammalian secretion vector and the recombinant vector is used as immunogen. In another example the same 
amino acids are cloned into an Fc-fusion secretion vector in which the 101P3A1 1 sequence is fused at the amino- 
terminus to an IgK leader sequence and at the carboxyl-terminus to the coding sequence of the human IgG Fc 
region. This recombinant vector is then used as immunogen. The plasmid immunization protocols are used in 

35 combination with purified proteins expressed from the same vector and with cells expressing 101P3A1 1. 

During the immunization protocol, test bleeds are taken 7-10 days following an injection to monitor titer 
and specificity of the immune response. Once appropriate reactivity and specificity is obtained as determined by 
ELISA, Western blotting, immunoprecipitation, fluorescence microscopy, and flow cytometric analyses, fusion and 
hybridoma generation is then carried out with established procedures well known in the art (see, e.g., Harlow and 

40 Lane, 1988). 
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In one embodiment for generating 101P3A1 1 monoclonal antibodies, a Tag5-101P3A1 1 antigen encoding 
amino acids 159-202 is expressed and purified from stably transfected 293T cells. Balb C mice are initially 
immunized intraperitoneally with 25 (ig of the Tag5-101P3A1 1 protein mixed in complete Freund's adjuvant. Mice 
are subsequently immunized every two weeks with 25 |Xg of the antigen mixed in incomplete Freund's adjuvant for 
5 a total of three immunizations. ELISA using the Tag5 antigen determines the titer of serum from immunized mice. 
Reactivity and specificity of serum to full length 101P3A1 1 protein is monitored by Western blotting, 
immunoprecipitation and flow cytometry using 293T cells transfected with an expression vector encoding the 
101P3A1 1 cDNA (see e.g., the Example entitled "Production of Recombinant 101P3A1 1 in Eukaryotic Systems"). 
Other recombinant 101P3A1 1 -expressing cells or cells endogenously expressing 101P3A1 1 are also used. Mice 
10 showing the strongest reactivity are rested and given a final injection of Tag5 antigen in PBS and then sacrificed 
four days later. The spleens of the sacrificed mice are harvested and fused to SPO/2 myeloma cells using standard 
procedures (Harlow and Lane, 1988). Supernatants from HAT selected growth wells are screened by ELISA, 
Western blot, immunoprecipitation, fluorescent microscopy, and flow cytometry to identify 101P3A1 1 specific 
antibody-producing clones. 

1 5 The binding affinity of a 101P3A1 1 monoclonal antibody is determined using standard technologies. 

Affinity measurements quantify the strength of antibody to epitope binding and are used to help define which 
101P3A1 1 monoclonal antibodies preferred, e.g., for diagnostic or therapeutic use, as appreciated by one of skill in 
the art. The BIAcore system (Uppsala, Sweden) is a preferred method for determining binding affinity. The 
BIAcore system uses surface plasmon resonance (SPR, Welford K. 1991, Opt. Quant. Elect. 23:1; Morton and 

20 Myszka, 1998, Methods in Enzymology 295: 268) to monitor biomolecular interactions in real time. BIAcore 
analysis conveniently generates association rate constants, dissociation rate constants, equilibrium dissociation 
constants, and affinity constants. 

Example 7: HLA Class I and Class II Binding Assays 

25 HLA class I and class II binding assays using purified HLA molecules are performed in accordance with 

disclosed protocols (e.g., PCT publications WO 94/20127 and WO 94/03205; Sidney et al., Current Protocols in 
Immunology 18.3.1 (1998); Sidney, et al, J. Immunol. 154:247 (1995); Sette, et al, Mol. Immunol. 31:813 (1994)). 
Briefly, purified MHC molecules (5 to 500 nM) are incubated with various unlabeled peptide inhibitors and 1-10 
nM l25 I-radiolabeled probe peptides as described. Following incubation, MHC-peptide complexes are separated 

30 from free peptide by gel filtration and the fraction of peptide bound is determined. Typically, in preliminary 
experiments, each MHC preparation is titered in the presence of fixed amounts of radiolabeled peptides to 
determine the concentration of HLA molecules necessary to bind 10-20% of the total radioactivity. All subsequent 
inhibition and direct binding assays are performed using these HLA concentrations. 

Since under these conditions [label]<[HLA] and IC 50 >[HLA], the measured IC 50 values are reasonable 

35 approximations of the true K D values. Peptide inhibitors are typically tested at concentrations ranging from 120 

fig/ml to 1.2 ng/ml, and are tested in two to four completely independent experiments. To allow comparison of the 
data obtained in different experiments, a relative binding figure is calculated for each peptide by dividing the IC 50 of 
a positive control for inhibition by the IC 50 for each tested peptide (typically unlabeled versions of the radiolabeled 
probe peptide). For database purposes, and inter-experiment comparisons, relative binding values are compiled. 
, 40 These values can subsequently be converted back into IC 50 nM values by dividing the IC 50 nM of the positive 

controls for inhibition by the relative binding of the peptide of interest. This method of data compilation is accurate 
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and consistent for comparing peptides that have been tested on different days, or with different lots of purified 
MHC. 

Binding assays as outlined above may be used to analyze HLA supermotif and/or HLA motif-bearing 

peptides. 

5 

Example 8: Identification of HLA Supermotif- and Motif-Bearing CTL Candidate Epitopes 

HLA vaccine compositions of the invention can include multiple epitopes. The multiple epitopes can 
comprise multiple HLA supermotifs or motifs to achieve broad population coverage. This example illustrates the 
identification and confirmation of supermotif- and motif-bearing epitopes for the inclusion in such a vaccine 
10 composition. Calculation of population coverage is performed using the strategy described below. 

Computer searches and algorithms for identification of supermotif and/or motif-bearing epitopes 
The searches performed to identify the motif-bearing peptide sequences in the Example entitled 
"Antigenicity Profiles" and Tables V-XVIII or XXVI-XXVIII employ the protein sequence data from the gene 
product of 101P3A1 1 set forth in Figures 2 and 3. 
1 5 Computer searches for epitopes bearing HLA Class I or Class II supermotifs or motifs are performed as 

follows. All translated 101P3A1 1 protein sequences are analyzed using a text string search software program to 
identify potential peptide sequences containing appropriate HLA binding motifs; such programs are readily 
produced in accordance with information in the art in view of known motif7supermotif disclosures. Furthermore, 
such calculations can be made mentally. 
20 Identified A2-, A3-, and DR-supermotif sequences are scored using polynomial algorithms to predict then- 

capacity to bind to specific HLA-Class I or Class II molecules. These polynomial algorithms account for the impact 
of different amino acids at different positions, and are essentially based on the premise that the overall affinity (or 
AG) of peptide-HLA molecule interactions can be approximated as a linear polynomial function of the type: 

"AG" = ai/ x a 2/ x a 3/ x a ni 

25 where a,, is a coefficient which represents the effect of the presence of a given amino acid (/") at a given position (i) 
along the sequence of a peptide of n amino acids. The crucial assumption of this method is that the effects at each 
position are essentially independent of each other (i.e., independent binding of individual side-chains). When 
residue j occurs at position i in the peptide, it is assumed to contribute a constant amount j) to the free energy of 
binding of the peptide irrespective of the sequence of the rest of the peptide. 

30 The method of derivation of specific algorithm coefficients has been described in Gulukota et al.,J. Mol. 

Biol 267:1258-126, 1997; (see also Sidney et ai., Human Immunol. 45:79-93, 1996; and Southwood et aL,J. 
Immunol. 160:3363-3373, 1998). Briefly, for all i positions, anchor and non-anchor alike, the geometric mean of 
the average relative binding (ARB) of all peptides carrying./ is calculated relative to the remainder of the group, and 
used as the estimate of For Class II peptides, if multiple alignments are possible, only the highest scoring 

35 alignment is utilized, following an iterative procedure. To calculate an algorithm score of a given peptide in a test 
set, the ARB values corresponding to the sequence of the peptide are multiplied. If this product exceeds a chosen 
threshold, the peptide is predicted to bind. Appropriate thresholds are chosen as a function of the degree of 
stringency of prediction desired. 

Selection of HLA-A2 supertype cross-reactive peptides 

40 Complete protein sequences from 101P3A1 1 are scanned utilizing motif identification software, to identify 

8-, 9- 10- and 1 1-mer sequences containing the HLA-A2-supermotif main anchor specificity. Typically, these 
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sequences are then scored using the protocol described above and the peptides corresponding to the positive-scoring 
sequences are synthesized and tested for their capacity to bind purified HLA-A*0201 molecules in vitro (HLA- 
A*0201 is considered a prototype A2 supertype molecule). 

These peptides are then tested for the capacity to bind to additional A2-supertype molecules (A*0202, 
5 A*0203, A*0206, and A*6802). Peptides that bind to at least three of the five A2-supertype alleles tested are 

typically deemed A2-supertype cross-reactive binders. Preferred peptides bind at an affinity equal to or less than 
500 nM to three or more HLA-A2 supertype molecules. 

Selection of HLA-A3 supermotif-bearing epitopes 

The 101P3A1 1 protein sequence scanned above is also examined for the presence of peptides with the 
10 HLA-A3-supermotif primary anchors. Peptides corresponding to the HLA A3 supermotif-bearing sequences are 
then synthesized and tested for binding to HLA-A*0301 and HLA-A*1 101 molecules, the molecules encoded by 
the two most prevalent A3 -supertype alleles. The peptides that bind at least one of the two alleles with binding 
affinities of <500 nM, often < 200 nM, are then tested for binding cross-reactivity to the other common A3- 
supertype alleles (e.g., A*3101, A*3301, and A*6801) to identify those that can bind at least three of the five HLA- 
15 A3 -supertype molecules tested. 

Selection of HLA-B7 supermotif bearing epitopes 

The 101P3A1 1 protein is also analyzed for the presence of 8-, 9- 10-, or 1 1-mer peptides with the HLA- 
B7-supermotif. Corresponding peptides are synthesized and tested for binding to HLA-B*0702, the molecule 
encoded by the most common B7-supertype allele (i.e., the prototype B7 supertype allele). Peptides binding 
20 B*0702 with IC 50 of <500 nM are identified using standard methods. These peptides are then tested for binding to 
other common B7-supertype molecules (e.g., B*3501, B*5101, B*5301, and B*5401). Peptides capable of binding 
to three or more of the five B7-supertype alleles tested are thereby identified. 

Selection of Al and A24 motif-bearing epitopes 

To further increase population coverage, HLA-A1 and -A24 epitopes can also be incorporated into vaccine 
25 compositions. An analysis of the 101P3A1 1 protein can also be performed to identify HLA-A1- and A24-motif- 
containing sequences. 

High affinity and/or cross-reactive binding epitopes that bear other motif and/or supermotifs are identified 
using analogous methodology. 

30 Example 9: Confirmation of Immunogenicitv 

Cross-reactive candidate CTL A2 -supermotif-bearing peptides that are identified as described herein are 
selected to confirm in vitro immunogenicity. Confirmation is performed using the following methodology: 
Target Cell Lines for Cellular Screening: 

The .221A2.1 cell line, produced by transferring the HLA-A2.1 gene into the HLA- A, -B, -C null mutant 
35 human B-lymphoblastoid cell line 721 .221, is used as the peptide-loaded target to measure activity of HLA-A2.1- 
restricted CTL. This cell line is grown in RPMI-1640 medium supplemented with antibiotics, sodium pyruvate, 
nonessential amino acids and 10% (v/v) heat inactivated FCS. Cells that express an antigen of interest, or 
transfectants comprising the gene encoding the antigen of interest, can be used as target cells to confirm the ability 
of peptide-specific CTLs to recognize endogenous antigen. 
40 Primary CTL Induction Cultures: 
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Generation of Dendritic Cells (DC): PBMCs are thawed in RPMI with 30 ug/ml DNAse, washed twice 
and resuspended in complete medium (RPMI- 1640 plus 5% AB human serum, non-essential amino acids, sodium 
pyruvate, L-glutamine and penicillin/streptomycin). The monocytes are purified by plating 10 x 10 6 PBMC/well in 
a 6- well plate. After 2 hours at 37°C, the non-adherent cells are removed by gently shaking the plates and 
5 aspirating the supernatants. The wells are washed a total of three times with 3 ml RPMI to remove most of the non- 
adherent and loosely adherent cells. Three ml of complete medium containing 50 ng/ml of GM-CSF and 1,000 U/ml 
of IL-4 are then added to each well. TNFa is added to the DCs on day 6 at 75 ng/ml and the cells are used for CTL 
induction cultures on day 7. 

Induction of CTL with DC and Peptide: CD8+ T-cells are isolated by positive selection with Dynal 

10 immunomagnetic beads (Dynabeads® M-450) and the detacha-bead® reagent. Typically about 200-250x1 0 6 

PBMC are processed to obtain 24xl0 6 CD8 + T-cells (enough for a 48-well plate culture). Briefly, the PBMCs are 
thawed in RPMI with 30ug/ml DNAse, washed once with PBS containing 1% human AB serum and resuspended in 
PBS/1% AB serum at a concentration of 20xl0 6 cells/ml. The magnetic beads are washed 3 times with PBS/AB 
serum, added to the cells (140ul beads/20xl0 6 cells) and incubated for 1 hour at 4°C with continuous mixing. The 

15 beads and cells are washed 4x with PBS/AB serum to remove the nonadherent cells and resuspended at lOOxlO 6 
cells/ml (based on the original cell number) in PBS/AB serum containing lOO^il/ml detacha-bead® reagent and 30 
Hg/ml DNAse. The mixture is incubated for 1 hour at room temperature with continuous mixing. The beads are 
washed again with PBS/AB/DNAse to collect the CD8+ T-cells. The DC are collected and centrifuged at 1300 rpm 
for 5-7 minutes, washed once with PBS with 1% BSA, counted and pulsed with 40ng/ml of peptide at a cell 

20 concentration of 1-2x1 0 6 /ml in the presence of 3ug/ml 6 2 - microglobulin for 4 hours at 20°C. The DC are then 
irradiated (4,200 rads), washed 1 time with medium and counted again. 

Setting up induction cultures: 0.25 ml cytokine-generated DC (at 1x10 s cells/ml) are co-cultured with 
0.25ml of CD8+ T-cells (at 2xl0 6 cell/ml) in each well of a 48-well plate in the presence of 10 ng/ml of IL-7. 
Recombinant human IL-10 is added the next day at a final concentration of 10 ng/ml and rhuman IL-2 is added 48 

25 hours later at 10 IU/ml. 

Restimulation of the induction cultures with pep tide-pulsed adherent cells: Seven and fourteen days after 
the primary induction, the cells are restimulated with peptide-pulsed adherent cells. The PBMCs are thawed and 
washed twice with RPMI and DNAse. The cells are resuspended at 5xl0 6 cells/ml and irradiated at -4200 rads. 
The PBMCs are plated at 2x1 0 6 in 0.5 ml complete medium per well and incubated for 2 hours at 37°C. The plates 

30 are washed twice with RPMI by tapping the plate gently to remove the nonadherent cells and the adherent cells 

pulsed with lOug/ml of peptide in the presence of 3 jig/ml B 2 microglobulin in 0.25ml RPMI/5%AB per well for 2 
hours at 37°C. Peptide solution from each well is aspirated and the wells are washed once with RPMI. Most of the 
media is aspirated from the induction cultures (CD 8+ cells) and brought to 0.5 ml with fresh media. The cells are 
then transferred to the wells containing the peptide-pulsed adherent cells. Twenty four hours later recombinant 

35 human IL-10 is added at a final concentration of 10 ng/ml and recombinant human IL2 is added the next day and 
again 2-3 days later at 50IU/ml (Tsai et aL, Critical Reviews in Immunology 18(l-2):65-75, 1998). Seven days 
later, the cultures are assayed for CTL activity in a 51 Cr release assay. In some experiments the cultures are assayed 
for peptide-specific recognition in the in situ IFNy ELISA at the time of the second restimulation followed by assay 
of endogenous recognition 7 days later. After expansion, activity is measured in both assays for a side-by-side 

40 comparison. 

Measurement of CTL lytic activity by 51 Cr release. 
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Seven days after the second restimulation, cytotoxicity is determined in a standard (5 hr) 5l Cr release assay 
by assaying individual wells at a single E:T. Peptide-pulsed targets are prepared by incubating the cells with 
lOug/ml peptide overnight at 37°C. 

Adherent target cells are removed from culture flasks with trypsin-EDTA. Target cells are labeled with 
5 200u€i of 51 Cr sodium chromate (Dupont, Wilmington, DE) for 1 hour at 37°C. Labeled target cells are 

resuspended at 10 6 per ml and diluted 1:10 with K562 cells at a concentration of 3.3xl0 6 /ml (an NK-sensitive 
erythrobiastoma cell line used to reduce non-specific lysis). Target cells (100 ul) and effectors (lOOul) are plated in 
96 well round-bottom plates and incubated for 5 hours at 37°C. At that time, 100 ul of supernatant are collected 
from each well and percent lysis is determined according to the formula: 
10 [(cpm of the test sample- cpm of the spontaneous 5l Cr release sample)/(cpm of the maximal 5I Cr release sample- 
cpm of the spontaneous 51 Cr release sample)] x 100. 

Maximum and spontaneous release are determined by incubating the labeled targets with 1% Trition X- 
100 and media alone, respectively. A positive culture is defined as one in which the specific lysis (sample- 
background) is 10% or higher in the case of individual wells and is 15% or more at the two highest E:T ratios when 
1 5 expanded cultures are assayed. 

In situ Measurement of Human IFNy Production as an Indicator of Peptide-specific and Endogenous 

Recognition 

Immulon 2 plates are coated with mouse anti-human IFN;y monoclonal antibody (4 [Xg/ml 0.1M NaHC0 3 , 
pH8.2) overnight at 4°C. The plates are washed with Ca 2+ , Mg 2+ -free PBS/0.05% Tween 20 and blocked with 
20 PBS/ 10% FCS for two hours, after which the CTLs (100 jil/well) and targets (100 ul/well) are added to each well, 
leaving empty wells for the standards and blanks (which received media only). The target cells, either peptide- 
pulsed or endogenous targets, are used at a concentration of lxl 0 6 cells/ml. The plates are incubated for 48 hours at 
37°C with 5% C0 2 . 

Recombinant human IFN-gamma is added to the standard wells starting at 400 pg or 1200pg/100 
25 microliter/well and the plate incubated for two hours at 37°C. The plates are washed and 100 |il of biotinylated 

mouse anti-human IFN-gamma monoclonal antibody (2 microgram/ml in PBS/3%FCS/0.05% Tween 20) are added 
and incubated for 2 hours at room temperature. After washing again, 100 microliter HRP-streptavidin (1 :4000) are 
added and the plates incubated for one hour at room temperature. The plates are then washed 6x with wash buffer, 
100 microliter/well developing solution (TMB 1:1) are added, and the plates allowed to develop for 5-15 minutes. 
30 The reaction is stopped with 50 microliter/well 1M H3PO4 and read at OD450. A culture is considered positive if it 
measured at least 50 pg of IFN-gamma/well above background and is twice the background level of expression. 
CTL Expansion. 

Those cultures that demonstrate specific lytic activity against peptide-pulsed targets and/or tumor targets 
are expanded over a two week period with anti-CD3. Briefly, 5xl0 4 CD8+ cells are added to a T25 flask containing 

35 the following: lxlO 6 irradiated (4,200 rad) PBMC (autologous or allogeneic) per ml, 2xl0 5 irradiated (8,000 rad) 
EBV- transformed cells per ml, and OKT3 (anti-CD3) at 30ng per ml in RPMI-1640 containing 10% (v/v) human 
AB serum, non-essential amino acids, sodium pyruvate, 25 uM 2-mercaptoethanol, L-glutamine and 
penicillin/streptomycin. Recombinant human IL2 is added 24 hours later at a final concentration of 200IU/ml and 
every three days thereafter with fresh media at 50IU/ml. The cells are split if the cell concentration exceeds 

40 lxl0 6 /ml and the cultures are assayed between days 13 and 15 at E:T ratios of 30, 10,3 and 1:1 in the 51 Cr release 
assay or at lxl0 6 /ml in the in situ IFNy assay using the same targets as before the expansion. 
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Cultures are expanded in the absence of anti-CD3 + as follows. Those cultures that demonstrate specific 
lytic activity against peptide and endogenous targets are selected and 5xl0 4 CD8 + cells are added to a T25 flask 
containing the following: lxlO 6 autologous PBMC per ml which have been peptide-pulsed with 10 |Xg/ml peptide 
for two hours at 37°C and irradiated (4,200 rad); 2xl0 5 irradiated (8,000 rad) EBV-transformed cells per ml RPMI- 
5 1640 containing 10%(v/v) human AB serum, non-essential AA, sodium pyruvate, 25mM 2-ME, L-glutamine and 
gentamicin. 

Immunogenicitv of A2 supermotif-bearing peptides 

A2-supermotif cross-reactive binding peptides are tested in the cellular assay for the ability to induce 
peptide-specific CTL in normal individuals. In this analysis, a peptide is typically considered to be an epitope if it 
10 induces peptide-specific CTLs in at least individuals, and preferably, also recognizes the endogenously expressed 
peptide. 

Immunogenicity can also be confirmed using PBMCs isolated from patients bearing a tumor that expresses 
101P3A1 1 . Briefly, PBMCs are isolated from patients, re-stimulated with peptide-pulsed monocytes and assayed 
for the ability to recognize peptide-pulsed target cells as well as transfected cells endogenously expressing the 
15 antigen. 

Evaluation of A*03/A1 1 immunogenicitv 

HLA-A3 supermotif-bearing cross-reactive binding peptides are also evaluated for immunogenicity using 
methodology analogous for that used to evaluate the immunogenicity of the HLA- A2 supermotif peptides. 

Evaluation of B7 immunogenicitv 
20 Immunogenicity screening of the B7-supertype cross-reactive binding peptides identified as set forth 

herein are confirmed in a manner analogous to the confirmation of A2-and A3 -supermotif-bearing peptides. 

Peptides bearing other supermotifs/motifs, e.g., HLA-A1, HLA-A24 etc. are also confirmed using similar 
methodology 

25 Example 10: Implementation of the Extended Supermotif to Improve the Binding Capacity of 

Native Epitopes by Creating Analogs 

HLA motifs and supermotifs (comprising primary and/or secondary residues) are useful in the 
identification and preparation of highly cross-reactive native peptides, as demonstrated herein. Moreover, the 
definition of HLA motifs and supermotifs also allows one to engineer highly cross-reactive epitopes by identifying 
30 residues within a native peptide sequence which can be analoged to confer upon the peptide certain characteristics, 
e.g. greater cross-reactivity within the group of HLA molecules that comprise a supertype, and/or greater binding 
affinity for some or all of those HLA molecules. Examples of analoging peptides to exhibit modulated binding 
affinity are set forth in this example. 

Analoging at Primary Anchor Residues 
35 Peptide engineering strategies are implemented to further increase the cross-reactivity of the epitopes. For 

example, the main anchors of A2-supermotif-bearing peptides are altered, for example, to introduce a preferred L, I, 
V, or M at position 2, and I or V at the C-terminus. 

To analyze the cross-reactivity of the analog peptides, each engineered analog is initially tested for binding 
to the prototype A2 supertype allele A*0201, then, if A*0201 binding capacity is maintained, for A2-supertype 
40 cross-reactivity. 
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Alternatively, a peptide is confirmed as binding one or all supertype members and then analogued to 
modulate binding affinity to any one (or more) of the supertype members to add population coverage. 

The selection of analogs for immundgenicity in a cellular screening analysis is typically further restricted 
by the capacity of the parent wild type (WT) peptide to bind at least weakly, i.e., bind at an IC 5 o of 5000nM or less, 
5 to three of more A2 supertype alleles. The rationale for this requirement is that the WT peptides must be present 
endogenously in sufficient quantity to be biologically relevant. Analoged peptides have been shown to have 
increased immunogenicity and cross-reactivity by T cells specific for the parent epitope (see, e.g., Parkhurst et al, 
J. Immunol 157:2539, 1996; and Pogue et al, Proc. Natl Acad. Set USA 92:8166, 1995). 

In the cellular screening of these peptide analogs, it is important to confirm that analog-specific CTLs are 
10 also able to recognize the wild-type peptide and, when possible, target cells that endogenously express the epitope. 

Analoging of HLA-A3 and B7-supermotif-bearing peptides 

Analogs of HLA-A3 supermotif-bearing epitopes are generated using strategies similar to those employed 
in analoging HLA-A2 supermotif-bearing peptides. For example, peptides binding to 3/5 of the A3 -supertype 
molecules are engineered at primary anchor residues to possess a preferred residue (V, S, M, or A) at position 2. 
15 The analog peptides are then tested for the ability to bind A*03 and A*l 1 (prototype A3 supertype alleles). 

Those peptides that demonstrate < 500 nM binding capacity are then confirmed as having A3 -supertype cross- 
reactivity. 

Similarly to the A2- and A3- motif bearing peptides, peptides binding 3 or more B7-supertype alleles can 
be improved, where possible, to achieve increased cross-reactive binding or greater binding affinity or binding half 
20 life. B7 supermotif-bearing peptides are, for example, engineered to possess a preferred residue (V, I, L, or F) at 
the C-terminal primary anchor position, as demonstrated by Sidney et al (J. Immunol 157:3480-3490, 1996). 

Analoging at primary anchor residues of other motif and/or supermotif-bearing epitopes is performed in a 
like manner. 

The analog peptides are then be confirmed for immunogenicity, typically in a cellular screening assay. 
25 Again, it is generally important to demonstrate that analog-specific CTLs are also able to recognize the wild-type 
peptide and, when possible, targets that endogenously express the epitope. 
Analoging at Secondary Anchor Residues 

Moreover, HLA supermotifs are of value in engineering highly cross-reactive peptides and/or peptides that 
bind HLA molecules with increased affinity by identifying particular residues at secondary anchor positions that are 
30 associated with such properties. For example, the binding capacity of a B7 supermotif-bearing peptide with an F 

residue at position 1 is analyzed. The peptide is then analoged to, for example, substitute L for F at position 1 . The 
analoged peptide is evaluated for increased binding affinity, binding half life and/or increased cross-reactivity. 
Such a procedure identifies analoged peptides with enhanced properties. 

Engineered analogs with sufficiently improved binding capacity or cross-reactivity can also be tested for 
35 immunogenicity in HLA-B 7- transgenic mice, following for example, IF A immunization or lipopeptide 

immunization. Analogued peptides are additionally tested for the ability to stimulate a recall response using PBMC 
from patients with 101P3A1 1 -expressing tumors. 

Other analoguing strategies 

Another form of peptide analoguing, unrelated to anchor positions, involves the substitution of a cysteine 
40 with a-amino butyric acid. Due to its chemical nature, cysteine has the propensity to form disulfide bridges and 

sufficiently alter the peptide structurally so as to reduce binding capacity. Substitution of a-amino butyric acid for 
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cysteine not only alleviates this problem, but has been shown to improve binding and crossbinding capabilities in 
some instances (see, e.g., the review by Sette et al., In: Persistent Viral Infections, Eds. R. Ahmed and I. Chen, John 
Wiley & Sons, England, 1999). 

Thus, by the use of single amino acid substitutions, the binding properties and/or cross-reactivity of 
5 peptide ligands for HLA supertype molecules can be modulated. 

Example 11. Identification and confirmation of 101P3 All-derived sequences with HLA-DR binding 

motifs 

Peptide epitopes bearing an HLA class II supermotif or motif are identified and confirmed as outlined 
10 below using methodology similar to that described for HLA Class I peptides. 
Selection of HLA-DR-supermotif-bearing epitopes- 

To identify 101P3A1 1-derived, HLA class II HTL epitopes, the 101P3A1 1 antigen is analyzed for the 
presence of sequences bearing an HLA-DR-motif or supermotif. Specifically, 1 5-mer sequences are selected 
comprising a DR-supermotif, comprising a 9-mer core, and three-residue N- and C-terminal flanking regions (15 
15 amino acids total). 

Protocols for predicting peptide binding to DR molecules have been developed (Southwood et al., J. 
Immunol. 160:3363-3373, 1998). These protocols, specific for individual DR molecules, allow the scoring, and 
ranking, of 9-mer core regions. Each protocol not only scores peptide sequences for the presence of DR-supermotif 
primary anchors (i.e., at position 1 and position 6) within a 9-mer core, but additionally evaluates sequences for the 

20 presence of secondary anchors. Using allele-specific selection tables (see, e.g., Southwood et al, ibid.), it has been 
found that these protocols efficiently select peptide sequences with a high probability of binding a particular DR 
molecule. Additionally, it has been found that performing these protocols in tandem, specifically those for DR1, 
DR4w4, and DR7, can efficiently select DR cross-reactive peptides. 

The 101P3A1 1-derived peptides identified above are tested for their binding capacity for various common 

25 HLA-DR molecules. All peptides are initially tested for binding to the DR molecules in the primary panel: DR1, 

DR4w4, and DR7. Peptides binding at least two of these three DR molecules are then tested for binding to DR2w2 
pi, DR2w2 p2, DR6wl9, and DR9 molecules in secondary assays. Finally, peptides binding at least two of the 
four secondary panel DR molecules, and thus cumulatively at least four of seven different DR molecules, are 
screened for binding to DR4wl5, DR5wl 1, and DR8w2 molecules in tertiary assays. Peptides binding at least 

30 seven of the ten DR molecules comprising the primary, secondary, and tertiary screening assays are considered 
cross-reactive DR binders. 101P3A1 1-derived peptides found to bind common HLA-DR alleles are of particular 
interest. 

Selection of DR3 motif peptides 

Because HLA-DR3 is an allele that is prevalent in Caucasian, Black, and Hispanic populations, DR3 
35 binding capacity is a relevant criterion in the selection of HTL epitopes. Thus, peptides shown to be candidates 

may also be assayed for their DR3 binding capacity. However, in view of the binding specificity of the DR3 motif, 
peptides binding only to DR3 can also be considered as candidates for inclusion in a vaccine formulation. 

To efficiently identify peptides that bind DR3, target 101P3A1 1 antigens are analyzed for sequences 
carrying one of the two DR3-specific binding motifs reported by Geluk et al. (J. Immunol. 152:5742-5748, 1994). 
40 The corresponding peptides are then synthesized and confirmed as having the ability to bind DR3 with an affinity of 
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ljiM or better, i.e., less than 1 uM. Peptides are found that meet this binding criterion and qualify as HLA class II 
high affinity binders. 

DR3 binding epitopes identified in this manner are included in vaccine compositions with DR supermotif- 
bearing peptide epitopes. 

5 Similarly to the case of HLA class I motif-bearing peptides, the class II motif-bearing peptides are 

analoged to improve affinity or cross-reactivity. For example, aspartic acid at position 4 of the 9-mer core sequence 
is an optimal residue for DR3 binding, and substitution for that residue often improves DR 3 binding. 

Example 12; Immunogenicitv of 10 1P3 All-derived HTL epitopes 

10 This example determines immunogenic DR supermotif- and DR3 motif-bearing epitopes among those 

identified using the methodology set forth herein. 

Immunogenicity of HTL epitopes are confirmed in a manner analogous to the determination of 
immunogenicity of CTL epitopes, by assessing the ability to stimulate HTL responses and/or by using appropriate 
transgenic mouse models. Immunogenicity is determined by screening for: 1 .) in vitro primary induction using 

15 normal PBMC or 2.) recall responses from patients who have 101P3A1 1 -expressing tumors. 

Example 13: Calculation of phenotypic frequencies of HLA-supertvpes in various ethnic 
backgrounds to determine breadth of population coverage 

This example illustrates the assessment of the breadth of population coverage of a vaccine composition 

20 comprised of multiple epitopes comprising multiple supermotifs and/or motifs. 

In order to analyze population coverage, gene frequencies of HLA alleles are determined. Gene 
frequencies for each HLA allele are calculated from antigen or allele frequencies utilizing the binomial distribution 
formulae gf=l-(SQRT(l-af)) (see, e.g., Sidney et al., Human Immunol. 45:79-93, 1996). To obtain overall 
phenotypic frequencies, cumulative gene frequencies are calculated, and the cumulative antigen frequencies derived 

25 by the use of the inverse formula [af= 1 -( 1 -Cgf) 2 ] . 

Where frequency data is not available at the level of DNA typing, correspondence to the serologically 
defined antigen frequencies is assumed. To obtain total potential supertype population coverage no linkage 
disequilibrium is assumed, and only alleles confirmed to belong to each of the supertypes are included (minimal 
estimates). Estimates of total potential coverage achieved by inter-loci combinations are made by adding to the A 

30 coverage the proportion of the non-A covered population that could be expected to be covered by the B alleles 

considered (e.g., total=A+B*(l-A)). Confirmed members of the A3-like supertype are A3, Al 1, A31, A*3301, and 
A*6801 . Although the A3-like supertype may also include A34, A66, and A*7401, these alleles were not included 
in overall frequency calculations. Likewise, confirmed members of the A2-like supertype family are A*0201, 
A*0202, A*0203, A*0204, A*0205, A*0206, A*0207, A*6802, and A*6901. Finally, the B7-like supertype- 

35 confirmed alleles are: B7, B*3501-03, B51, B*5301, B*5401, B*5501-2, B*5601, B*6701, and B*7801 
(potentially also B*1401, B*3504-06, B*4201, and B*5602). 

Population coverage achieved by combining the A2-, A3- and B7-supertypes is approximately 86% in five 
major ethnic groups. Coverage may be extended by including peptides bearing the Al and A24 motifs. On 
average, Al is present in 12% and A24 in 29% of the population across five different major ethnic groups 

40 (Caucasian, North American Black, Chinese, Japanese, and Hispanic). Together, these alleles are represented with 
an average frequency of 39% in these same ethnic populations. The total coverage across the major ethnicities 
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when Al and A24 are combined with the coverage of the A2-, A3- and B7-supertype alleles is >95%. An 
analogous approach can be used to estimate population coverage achieved with combinations of class II motif- 
bearing epitopes. 

Immunogenicity studies in humans (e.g., Bertoni et al, J. Clin. Invest. 100:503, 1997; Doolan et al., 
5 Immunity 7:97, 1997; and Threlkeld et al.,J. Immunol. 159: 1648, 1997) have shown that highly cross-reactive 

binding peptides are almost always recognized as epitopes. The use of highly cross-reactive binding peptides is an 
important selection criterion in identifying candidate epitopes for inclusion in a vaccine that is immunogenic in a 
diverse population. 

With a sufficient number of epitopes (as disclosed herein and from the art), an average population 
10 coverage is predicted to be greater than 95% in each of five major ethnic populations. The game theory Monte 

Carlo simulation analysis, which is known in the art (see e.g., Osborne, M.J. and Rubinstein, A. "A course in game 
theory" MIT Press, 1994), can be used to estimate what percentage of the individuals in a population comprised of 
the Caucasian, North American Black, Japanese, Chinese, and Hispanic ethnic groups would recognize the vaccine 
epitopes described herein. A preferred percentage is 90%. A more preferred percentage is 95%. 
1 5 Recombinant NIH3T3 cells stably expressing 1 0 1 P3 A 1 1 (3T3/ 1 0 1 P3 A 1 1 -pSRa) were derived and were 

grown in SCID mice (Figure 20). Mice were injected subcutaneously with 3T3/101P3A1 1-pSRa cells. Tumors 
were allowed to grow, mice were then sacrificed and tumors harvested. RNA was isolated from LAPC-4AD and 
LAPC-4AI xenografts, 3T3/neo-pSRa and 3T3/101P3A1 1-pSRa grown in culture, as well as six different tumors 
derived from 3T3/101P3A1 1-pSRa (Tumor #1-3). Northern blots with 10 [ig of total RNA/lane were probed with 
20 101P3A1 1 sequence. Results show that 101P3A1 1 was expressed in all 3T3-101P3A1 1 tumors that grew following 
injection into mice. 

Example 14: CTL Recognition Of Endogenously Processed Antigens After Priming 

This example confirms that CTL induced by native or analoged peptide epitopes identified and selected as 

25 described herein recognize endogenously synthesized, i.e., native antigens. 

Effector cells isolated from transgenic mice that are immunized with peptide epitopes, for example HLA- 
A2 supermotif-bearing epitopes, are re-stimulated in vitro using peptide-coated stimulator cells. Six days later, 
effector cells are assayed for cytotoxicity and the cell lines that contain peptide-specific cytotoxic activity are 
further re-stimulated. An additional six days later, these cell lines are tested for cytotoxic activity on 51 Cr labeled 

30 Jurkat-A2.1/K b target cells in the absence or presence of peptide, and also tested on 51 Cr labeled target cells bearing 
the endogenously synthesized antigen, i.e. cells that are stably transfected with 101P3A11 expression vectors. 

The results demonstrate that CTL lines obtained from animals primed with peptide epitope recognize 
endogenously synthesized 101P3A1 1 antigen. The choice of transgenic mouse model to be used for such an 
analysis depends upon the epitope(s) that are being evaluated. In addition to HLA-A*0201/K b transgenic mice, 

35 several other transgenic mouse models including mice with human Al 1, which may also be used to evaluate A3 

epitopes, and B7 alleles have been characterized and others (e.g., transgenic mice for HLA-A1 and A24) are being 
developed. HLA-DR1 and HLA-DR3 mouse models have also been developed, which may be used to evaluate 
HTL epitopes. 

40 Example 15: Activity Of CTL-HTL Conjugated Epitopes In Transgenic Mice 
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This example illustrates the induction of CTLs and HTLs in transgenic mice, by use of a 101P3A1 1- 
derived CTL and HTL peptide vaccine compositions. The vaccine composition used herein comprise peptides to be 
administered to a patient with a 101P3A1 1 -expressing tumor. The peptide composition can comprise multiple CTL 
and/or HTL epitopes. The epitopes are identified using methodology as described herein. This example also 
5 illustrates that enhanced immunogenicity can be achieved by inclusion of one or more HTL epitopes in a CTL 

vaccine composition; such a peptide composition can comprise an HTL epitope conjugated to a CTL epitope. The 
CTL epitope can be one that binds to multiple HLA family members at an affinity of 500 nM or less, or analogs of 
that epitope. The peptides may be lipidated, if desired. 

Immunization procedures: Immunization of transgenic mice is performed as described (Alexander et al. , 

10 J. Immunol 159:4753-4761, 1997). For example, A2/K b mice, which are transgenic for the human HLA A2.1 allele 
and are used to confirm the immunogenicity of HLA-A*0201 motif- or HLA-A2 supermotif-bearing epitopes, and 
are primed subcutaneously (base of the tail) with a 0.1 ml of peptide in Incomplete Freund's Adjuvant, or if the 
peptide composition is a lipidated CTL/HTL conjugate, in DMSO/saline, or if the peptide composition is a 
polypeptide, in PBS or Incomplete Freund's Adjuvant. Seven days after priming, splenocytes obtained from these 

15 animals are restimulated with syngenic irradiated LPS-activated lymphoblasts coated with peptide. 

Cell lines: Target cells for peptide-specific cytotoxicity assays are Jurkat cells transfected with the HLA- 
A2.1/K b chimeric gene {e.g., Vitiello et al, J. Exp. Med. 173:1007, 1991) 

In vitro CTL activation: One week after priming, spleen cells (30x1 0 6 cells/flask) are co-cultured at 37°C 
with syngeneic, irradiated (3000 rads), peptide coated lymphoblasts (lOxlO 6 cells/flask) in 10 ml of culture 

20 medium/T25 flask. After six days, effector cells are harvested and assayed for cytotoxic activity. 

Assay for cytotoxic activity: Target cells (1.0 to 1.5xl0 6 ) are incubated at 37°C in the presence of 200 ul 
of 51 Cr. After 60 minutes, cells are washed three times and resuspended in R10 medium. Peptide is added where 
required at a concentration of 1 ug/ml. For the assay, 10 4 51 Cr-labeled target cells are added to different 
concentrations of effector cells (final volume of 200 ul) in U-bottom 9 6- well plates. After a six hour incubation 

25 period at 37°C, a 0.1 ml aliquot of supernatant is removed from each well and radioactivity is determined in a 

Micromedic automatic gamma counter. The percent specific lysis is determined by the formula: percent specific 
release = 100 x (experimental release - spontaneous release)/(maximum release - spontaneous release). To facilitate 
comparison between separate CTL assays run under the same conditions, % 51 Cr release data is expressed as lytic 
units/ 10 6 cells. One lytic unit is arbitrarily defined as the number of effector cells required to achieve 30% lysis of 

30 10,000 target cells in a six hour 51 Cr release assay. To obtain specific lytic units/10 6 , the lytic units/10 6 obtained in 
the absence of peptide is subtracted from the lytic units/ 10 6 obtained in the presence of peptide. For example, if 
30% 5, Cr release is obtained at the effector (E): target (T) ratio of 50: 1 (i.e., 5xl0 5 effector cells for 10,000 targets) 
in the absence of peptide and 5:1 (i.e., 5xl0 4 effector cells for 10,000 targets) in the presence of peptide, the specific 
lytic units would be: [(l/50,000)-( 1/500,000)] x 10 6 = 18 LU. 

35 The results are analyzed to assess the magnitude of the CTL responses of animals injected with the 

immunogenic CTL/HTL conjugate vaccine preparation and are compared to the magnitude of the CTL response 
achieved using, for example, CTL epitopes as outlined above in the Example entitled "Confirmation of 
Immunogenicity". Analyses similar to this may be performed to confirm the immunogenicity of peptide conjugates 
containing multiple CTL epitopes and/or multiple HTL epitopes. In accordance with these procedures, it is found 

40 that a CTL response is induced, and concomitantly that an HTL response is induced upon administration of such 
compositions. 
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Example 16: Selection of CTL and HTL epitopes for inclusion in an 101P3 All-specific vaccine. 
This example illustrates a procedure for selecting peptide epitopes for vaccine compositions of the 
invention. The peptides in the composition can be in the form of a nucleic acid sequence, either single or one or 
5 more sequences (i.e., minigene) that encodes peptide(s), or can be single and/or polyepitopic peptides. 

The following principles are utilized when selecting a plurality of epitopes for inclusion in a vaccine 
composition. Each of the following principles is balanced in order to make the selection. 

Epitopes are selected which, upon administration, mimic immune responses that are correlated with 
101P3A1 1 clearance. The number of epitopes used depends on observations of patients who spontaneously clear 
10 101P3A1 1. For example, if it has been observed that patients who spontaneously clear 101P3A1 1 generate an 

immune response to at least three (3) from 101P3A1 1 antigen, then three or four (3-4) epitopes should be included 
for HLA class I. A similar rationale is used to determine HLA class II epitopes. 

Epitopes are often selected that have a binding affinity of an IC 5 o of 500 nM or less for an HLA class I 
molecule, or for class II, an IC 50 of 1000 nM or less; or HLA Class I peptides with high binding scores from the 
1 5 BIMAS web site, at URL bimas.dcrt.nih.gov/ . 

In order to achieve broad coverage of the vaccine through out a diverse population, sufficient supermotif 
bearing peptides, or a sufficient array of allele-specific motif bearing peptides, are selected to give broad population 
coverage. In one embodiment, epitopes are selected to provide at least 80% population coverage. A Monte Carlo 
analysis, a statistical evaluation known in the art, can be employed to assess breadth, or redundancy, of population 
20 coverage. 

When creating polyepitopic compositions, or a minigene that encodes same, it is typically desirable to 
generate the smallest peptide possible that encompasses the epitopes of interest. The principles employed are 
similar, if not the same, as those employed when selecting a peptide comprising nested epitopes. For example, a 
protein sequence for the vaccine composition is selected because it has maximal number of epitopes contained 

25 within the sequence, i.e., it has a high concentration of epitopes. Epitopes may be nested or overlapping (i.e., frame 
shifted relative to one another). For example, with overlapping epitopes, two 9-mer epitopes and one 10-mer 
epitope can be present in a 10 amino acid peptide. Each epitope can be exposed and bound by an HLA molecule 
upon administration of such a peptide. A multi-epitopic, peptide can be generated synthetically, recombinantly, or 
via cleavage from the native source. Alternatively, an analog can be made of this native sequence, whereby one or 

30 more of the epitopes comprise substitutions that alter the cross-reactivity and/or binding affinity properties of the 
polyepitopic peptide. Such a vaccine composition is administered for therapeutic or prophylactic purposes. This 
embodiment provides for the possibility that an as yet undiscovered aspect of immune system processing will apply 
to the native nested sequence and thereby facilitate the production of therapeutic or prophylactic immune response- 
inducing vaccine compositions. Additionally such an embodiment provides for the possibility of motif-bearing 

35 epitopes for an HLA makeup that is presently unknown. Furthermore, this embodiment (absent the creating of any 
analogs) directs the immune response to multiple peptide sequences that are actually present in 101P3A1 1, thus 
avoiding the need to evaluate any junctional epitopes. Lastly, the embodiment provides an economy of scale when 
producing nucleic acid vaccine compositions. Related to this embodiment, computer programs can be derived in 
accordance with principles in the art, which identify in a target sequence, the greatest number of epitopes per 

40 sequence length. 
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A vaccine composition comprised of selected peptides, when administered, is safe, efficacious, and elicits 
an immune response similar in magnitude to an immune response that controls or clears cells that bear or 
overexpress 101P3A11. 


5 Example 17: Construction of "Minigene" Multi-Epitope DNA Plasmids 

This example discusses the construction of a minigene expression plasmid. Minigene plasmids may, of 
course, contain various configurations of B cell, CTL and/or HTL epitopes or epitope analogs as described herein. 

A minigene expression plasmid typically includes multiple CTL and HTL peptide epitopes. In the present 
example, HLA-A2, -A3, -B7 supermotif-bearing peptide epitopes and HLA-A1 and -A24 motif-bearing peptide 

10 epitopes are used in conjunction with DR supermotif-bearing epitopes and/or DR3 epitopes. HLA class I 

supermotif or motif-bearing peptide epitopes derived 101P3A1 1, are selected such that multiple supermotifs/motifs 
are represented to ensure broad population coverage. Similarly, HLA class II epitopes are selected from 101P3A1 1 
to provide broad population coverage, i.e. both HLA DR- 1-4-7 supermotif-bearing epitopes and HLA DR-3 motif- 
bearing epitopes are selected for inclusion in the minigene construct. The selected CTL and HTL epitopes are then 

1 5 incorporated into a minigene for expression in an expression vector. 

Such a construct may additionally include sequences that direct the HTL epitopes to the endoplasmic 
reticulum. For example, the Ii protein may be fused to one or more HTL epitopes as described in the art, wherein 
the CLIP sequence of the Ii protein is removed and replaced with an HLA class II epitope sequence so that HLA 
class II epitope is directed to the endoplasmic reticulum, where the epitope binds to an HLA class II molecules. 

20 This example illustrates the methods to be used for construction of a minigene-bearing expression plasmid. 

Other expression vectors that may be used for minigene compositions are available and known to those of skill in 
the art. 

The minigene DNA plasmid of this example contains a consensus Kozak sequence and a consensus murine 
kappa Ig- light chain signal sequence followed by CTL and/or HTL epitopes selected in accordance with principles 

25 disclosed herein. The sequence encodes an open reading frame fused to the Myc and His antibody epitope tag 
coded for by the pcDNA 3.1 Myc-His vector. 

Overlapping oligonucleotides that can, for example, average about 70 nucleotides in length with 1 5 
nucleotide overlaps, are synthesized and HPLC-purifled. The oligonucleotides encode the selected peptide epitopes 
as well as appropriate linker nucleotides, Kozak sequence, and signal sequence. The final multiepitope minigene is 

30 assembled by extending the overlapping oligonucleotides in three sets of reactions using PCR. A Perkin/Elmer 

9600 PCR machine is used and a total of 30 cycles are performed using the following conditions: 95°C for 15 sec, 
annealing temperature (5° below the lowest calculated Tm of each primer pair) for 30 sec, and 72°C for 1 min. 

For example, a minigene is prepared as follows. For a first PCR reaction, 5 jig of each of two 
oligonucleotides are annealed and extended: In an example using eight oligonucleotides, i.e., four pairs of primers, 

35 oligonucleotides 1+2, 3+4, 5+6, and 7+8 are combined in 100 jil reactions containing Pfu polymerase buffer (lx= 
10 mM KCL, 10 mM (NH4) 2 S0 4 , 20 mM Tris-chloride, pH 8.75, 2 mM MgS0 4 , 0.1% Triton X-100, 100 ng/ml 
BSA), 0.25 mM each dNTP, and 2.5 U of Pfu polymerase. The full-length dimer products are gel-purified, and two 
reactions containing the product of 1+2 and 3+4, and the product of 5+6 and 7+8 are mixed, annealed, and extended 
for 10 cycles. Half of the two reactions are then mixed, and 5 cycles of annealing and extension carried out before 

40 flanking primers are added to amplify the full length product. The full-length product is gel-purified and cloned 
into pCR-blunt (Invitrogen) and individual clones are screened by sequencing. 
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Example 18: The Plasmid Construct and the Degree to Which It Induces Immunogenicity. 

The degree to which a plasmid construct, for example a plasmid constructed in accordance with the 
previous Example, is able to induce immunogenicity is confirmed in vitro by determining epitope presentation by 
5 APC following transduction or transfection of the APC with an epitope-expressing nucleic acid construct. Such a 
study determines "antigenicity" and allows the use of human APC. The assay determines the ability of the epitope 
to be presented by the APC in a context that is recognized by a T cell by quantifying the density of epitope-HLA 
class I complexes on the cell surface. Quantitation can be performed by directly measuring the amount of peptide 
eluted from the APC {see, e.g., Sijts et al.,J. Immunol. 156:683-692, 1996; Demotz et al, Nature 342:682-684, 
10 1989); or the number of peptide-HLA class I complexes can be estimated by measuring the amount of lysis or 
lymphokine release induced by diseased or transfected target cells, and then detennining the concentration of 
peptide necessary to obtain equivalent levels of lysis or lymphokine release (see, e.g., Kageyama et al., J. Immunol. 
154:567-576, 1995). 

Alternatively, immunogenicity is confirmed through in vivo injections into mice and subsequent in vitro 
1 5 assessment of CTL and HTL activity, which are analyzed using cytotoxicity and proliferation assays, respectively, 
as detailed e.g., in Alexander et al., Immunity 1:751-761, 1994. 

For example, to confirm the capacity of a DNA minigene construct containing at least one HLA-A2 
supermotif peptide to induce CTLs in vivo, HLA-A2.1/K b transgenic mice, for example, are immunized 
intramuscularly with 100 jxg of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 
20 immunization, a control group of animals is also immunized with an actual peptide composition that comprises 
multiple epitopes synthesized as a single polypeptide as they would be encoded by the minigene. 

Splenocytes from immunized animals are stimulated twice with each of the respective compositions 
(peptide epitopes encoded in the minigene or the polyepitopic peptide), then assayed for peptide-specific cytotoxic 
activity in a 51 Cr release assay. The results indicate the magnitude of the CTL response directed against the A2- 
25 restricted epitope, thus indicating the in vivo immunogenicity of the minigene vaccine and polyepitopic vaccine. 

It is, therefore, found that the minigene elicits immune responses directed toward the HLA-A2 supermotif 
peptide epitopes as does the polyepitopic peptide vaccine. A similar analysis is also performed using other HLA- 
A3 and HLA-B7 transgenic mouse models to assess CTL induction by HLA-A3 and HLA-B7 motif or supermotif 
epitopes, whereby it is also found that the minigene elicits appropriate immune responses directed toward the 
30 provided epitopes. 

To confirm the capacity of a class II epitope-encoding minigene to induce HTLs in vivo, DR transgenic 
mice, or for those epitopes that cross react with the appropriate mouse MHC molecule, I-A b -restricted mice, for 
example, are immunized intramuscularly with 100 Jig of plasmid DNA. As a means of comparing the level of 
HTLs induced by DNA immunization, a group of control animals is also immunized with an actual peptide 

35 composition emulsified in complete Freund's adjuvant. CD4+ T cells, i.e. HTLs, are purified from splenocytes of 
immunized animals and stimulated with each of the respective compositions (peptides encoded in the minigene). 
The HTL response is measured using a 3 H-thymidine incorporation proliferation assay, (see, e.g., Alexander et al. 
Immunity 1:751-761, 1994). The results indicate the magnitude of the HTL response, thus demonstrating the in 
vivo immunogenicity of the minigene. 

40 DNA minigenes, constructed as described in the previous Example, can also be confirmed as a vaccine in 

combination with a boosting agent using a prime boost protocol. The boosting agent can consist of recombinant 
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protein (e.g., Barnett et aL, Aids Res. and Human Retroviruses 14, Supplement 3:S299-S309, 1998) or recombinant 
vaccinia, for example, expressing a minigene or DN A encoding the complete protein of interest (see, e.g. , Hanke et 
al. 9 Vaccine 16:439-445, 1998; Sedegah et aL, Proc. Natl. Acad. Sci USA 95:7648-53, 1998; Hanke and 
McMichael, Immunol. Letters 66:177-181, 1999; and Robinson et aL, Nature Med. 5:526-34, 1999). 
5 For example, the efficacy of the DNA minigene used in a prime boost protocol is initially evaluated in 

transgenic mice. In this example, A2. 1/K b transgenic mice are immunized IM with 100 [ig of a DNA minigene 
encoding the immunogenic peptides including at least one HLA-A2 supermotif-bearing peptide. After an 
incubation period (ranging from 3-9 weeks), the mice are boosted IP with 10 7 pfu/mouse of a recombinant vaccinia 
virus expressing the same sequence encoded by the DNA minigene. Control mice are immunized with 100 |lg of 

10 DNA or recombinant vaccinia without the minigene sequence, or with DNA encoding the minigene, but without the 
vaccinia boost. After an additional incubation period of two weeks, splenocytes from the mice are immediately 
assayed for peptide-specific activity in an ELISPOT assay. Additionally, splenocytes are stimulated in vitro with 
the A2-restricted peptide epitopes encoded in the minigene and recombinant vaccinia, then assayed for peptide- 
specific activity in an alpha, beta and/or gamma IFN ELISA. 

15 It is found that the minigene utilized in a prime-boost protocol elicits greater immune responses toward the 

HLA-A2 supermotif peptides than with DNA alone. Such an analysis can also be performed using HLA-A1 1 or 
HLA-B7 transgenic mouse models to assess CTL induction by HLA-A3 or HLA-B7 motif or supermotif epitopes. 
The use of prime boost protocols in humans is described below in the Example entitled "Induction of CTL 
Responses Using a Prime Boost Protocol ." 

20 

Example 19: Peptide Composition for Prophylactic Uses 

Vaccine compositions of the present invention can be used to prevent 101P3A1 1 expression in persons 
who are at risk for tumors that bear this antigen. For example, a polyepitopic peptide epitope composition (or a 
nucleic acid comprising the same) containing multiple CTL and HTL epitopes such as those selected in the above 

25 Examples, which are also selected to target greater than 80% of the population, is administered to individuals at risk 
for a 101P3A1 1 -associated tumor. 

For example, a peptide-based composition is provided as a single polypeptide that encompasses multiple 
epitopes. The vaccine is typically administered in a physiological solution that comprises an adjuvant, such as 
Incomplete Freund's Adjuvant. The dose of peptide for the initial immunization is from about 1 to about 50,000 

30 jig, generally 100-5,000 jug, for a 70 kg patient. The initial administration of vaccine is followed by booster 

dosages at 4 weeks followed by evaluation of the magnitude of the immune response in the patient, by techniques 
that determine the presence of epitope-specific CTL populations in a PBMC sample. Additional booster doses are 
administered as required. The composition is found to be both safe and efficacious as a prophylaxis against 
101P3A1 1 -associated disease. 

35 Alternatively, a composition typically comprising transfecting agents is used for the administration of a 

nucleic acid-based vaccine in accordance with methodologies known in the art and disclosed herein. 


Example 20: Polyepitopic Vaccine Compositions Derived from Native 101P3A11 Sequences 

A native 101P3A1 1 polyprotein sequence is analyzed, preferably using computer algorithms defined for 
40 each class I and/or class II supermotif or motif, to identify "relatively short" regions of the polyprotein that 
comprise multiple epitopes. The "relatively short" regions are preferably less in length than an entire native 
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antigen. This relatively short sequence that contains multiple distinct or overlapping, "nested" epitopes is selected; 
it can be used to generate a minigene construct. The construct is engineered to express the peptide, which 
corresponds to the native protein sequence. The "relatively short" peptide is generally less than 250 amino acids in 
length, often less than 1 00 amino acids in length, preferably less than 75 amino acids in length, and more preferably 
5 less than 50 amino acids in length. The protein sequence of the vaccine composition is selected because it has 

maximal number of epitopes contained within the sequence, i.e., it has a high concentration of epitopes. As noted 
herein, epitope motifs may be nested or overlapping (i.e., frame shifted relative to one another). For example, with 
overlapping epitopes, two 9-mer epitopes and one 10-mer epitope can be present in a 10 amino acid peptide. Such a 
vaccine composition is administered for therapeutic or prophylactic purposes. 
10 The vaccine composition will include, for example, multiple CTL epitopes from 101P3A1 1 antigen and at 

least one HTL epitope. This polyepitopic native sequence is administered either as a peptide or as a nucleic acid 
sequence which encodes the peptide. Alternatively, an analog can be made of this native sequence, whereby one or 
more of the epitopes comprise substitutions that alter the cross-reactivity and/or binding affinity properties of the 
polyepitopic peptide. 

1 5 The embodiment of this example provides for the possibility that an as yet undiscovered aspect of immune 

system processing will apply to the native nested sequence and thereby facilitate the production of therapeutic or 
prophylactic immune response-inducing vaccine compositions. Additionally such an embodiment provides for the 
possibility of motif-bearing epitopes for an HLA makeup that is presently unknown. Furthermore, this embodiment 
(excluding an analoged embodiment) directs the immune response to multiple peptide sequences that are actually 

20 present in native 101P3A1 1, thus avoiding the need to evaluate any junctional epitopes. Lastly, the embodiment 
provides an economy of scale when producing peptide or nucleic acid vaccine compositions. 

Related to this embodiment, computer programs are available in the art which can be used to identify in a 
target sequence, the greatest number of epitopes per sequence length. 

25 Example 21: Polyepitopic Vaccine Compositions From Multiple Antigens 

The 101P3A1 1 peptide epitopes of the present invention are used in conjunction with epitopes from other 
target tumor-associated antigens, to create a vaccine composition that is useful for the prevention or treatment of 
cancer that expresses 101P3A1 1 and such other antigens. For example, a vaccine composition can be provided as a 
single polypeptide that incorporates multiple epitopes from 101P3A1 1 as well as tumor-associated antigens that are 
30 often expressed with a target cancer associated with 101P3A1 1 expression, or can be administered as a composition 
comprising a cocktail of one or more discrete epitopes. Alternatively, the vaccine can be administered as a 
minigene construct or as dendritic cells which have been loaded with the peptide epitopes in vitro. 

Example 22: Use of peptides to evaluate an immune response 

35 Peptides of the invention may be used to analyze an immune response for the presence of specific 

antibodies, CTL or HTL directed to 101P3A1 1. Such an analysis can be performed in a manner described by Ogg 
et al, Science 279:2103-2106, 1998. In this Example, peptides in accordance with the invention are used as a 
reagent for diagnostic or prognostic purposes, not as an immunogen. 

In this example highly sensitive human leukocyte antigen tetrameric complexes ("tetramers") are used for 

40 a cross-sectional analysis of, for example, 101P3A1 1 HLA-A*0201 -specific CTL frequencies from HLA A*0201- 
positive individuals at different stages of disease or following immunization comprising an 101P3A1 1 peptide 
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containing an A*0201 motif. Tetrameric complexes are synthesized as described (Musey et ah, N. Engl. J. Med. 
337: 1267, 1997). Briefly, purified HLA heavy chain (A*0201 in this example) and [3 2 -microglobulin are 
synthesized by means of a prokaryotic expression system. The heavy chain is modified by deletion of the 
transmembrane-cytosolic tail and COOH-terminal addition of a sequence containing a BirA enzymatic biotinylation 
5 site. The heavy chain, p2 -microglobulin, and peptide are refolded by dilution. The 45-kD refolded product is 

isolated by fast protein liquid chromatography and then biotinylated by BirA in the presence of biotin (Sigma, St. 
Louis, Missouri), adenosine 5' triphosphate and magnesium. Streptavidin-phycoerythrin conjugate is added in a 1 :4 
molar ratio, and the tetrameric product is concentrated to 1 mg/ml. The resulting product is referred to as tetramer- 
phycoerythrin. 

10 For the analysis of patient blood samples, approximately one million PBMCs are centrifuged at 300g for 5 

minutes and resuspended in 50 |il of cold phosphate-buffered saline. Tri-color analysis is performed with the 
tetramer-phycoerythrin, along with anti-CD8 -Tricolor, and anti-CD38. The PBMCs are incubated with tetramer 
and antibodies on ice for 30 to 60 min and then washed twice before formaldehyde fixation. Gates are applied to 
contain >99.98% of control samples. Controls for the tetramers include both A*0201 -negative individuals and 

15 A*0201 -positive non-diseased donors. The percentage of cells stained with the tetramer is then determined by flow 
cytometry. The results indicate the number of cells in the PBMC sample that contain epitope-restricted CTLs, 
thereby readily indicating the extent of immune response to the 101P3A1 1 epitope, and thus the status of exposure 
to 101P3A1 1, or exposure to a vaccine that elicits a protective or therapeutic response. 

20 Example 23: Use of Peptide Epitopes to Evaluate Recall Responses 

The peptide epitopes of the invention are used as reagents to evaluate T cell responses, such as acute or 
recall responses, in patients. Such an analysis may be performed on patients who have recovered from 101P3A1 1- 
associated disease or who have been vaccinated with an 101P3A1 1 vaccine. 

For example, the class I restricted CTL response of persons who have been vaccinated may be analyzed. 
25 The vaccine may be any 101P3A1 1 vaccine. PBMC are collected from vaccinated individuals and HLA typed. 
Appropriate peptide epitopes of the invention that, optimally, bear supermotifs to provide cross-reactivity with 
multiple HLA supertype family members, are then used for analysis of samples derived from individuals who bear 
that HLA type. 

PBMC from vaccinated individuals are separated on Ficoll-Histopaque density gradients (Sigma Chemical 
30 Co., St. Louis, MO), washed three times in HBSS (GIBCO Laboratories), resuspended in RPMM640 (GIBCO 
Laboratories) supplemented with L-glutamine (2mM), penicillin (50U/ml), streptomycin (50 |ig/ml), and Hepes 
(lOmM) containing 10% heat- inactivated human AB serum (complete RPMI) and plated using microculture 
formats. A synthetic peptide comprising an epitope of the invention is added at 10 [Xg/ml to each well and HBV 
core 128-140 epitope is added at 1 (ig/ml to each well as a source of T cell help during the first week of stimulation. 
35 In the microculture format, 4 x 10 5 PBMC are stimulated with peptide in 8 replicate cultures in 96- well 

round bottom plate in 100 pi/well of complete RPMI. On days 3 and 10, 100 |il of complete RPMI and 20 U/ml 
final concentration of rIL-2 are added to each well. On day 7 the cultures are transferred into a 96- well flat-bottom 
plate and restimulated with peptide, rIL-2 and 10 5 irradiated (3,000 rad) autologous feeder cells. The cultures are 
tested for cytotoxic activity on day 14. A positive CTL response requires two or more of the eight replicate cultures 
40 to display greater than 10% specific 51 Cr release, based on comparison with non-diseased control subjects as 
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previously described (Rehermann, et al, Nature Med. 2:1 104,1 108, 1996; Rehermann et al., J. Clin. Invest. 
97:1655-1665, 1996; and Rehermann et al. J. Clin. Invest. 98:1432-1440, 1996). 

Target cell lines are autologous and allogeneic EBV-transformed B-LCL that are either purchased from the 
American Society for Histocompatibility and Immunogenetics (ASHI, Boston, MA) or established from the pool of 
5 patients as described (Guilhot, et al. J. Virol. 66:2670-2678, 1992). 

Cytotoxicity assays are performed in the following manner. Target cells consist of either allogeneic HLA- 
matched or autologous EBV-transformed B lymphoblastoid cell line that are incubated overnight with the synthetic 
peptide epitope of the invention at 10 |lM, and labeled with 100 |lCi of 51 Cr (Amersham Corp., Arlington Heights, 
IL) for 1 hour after which they are washed four times with HBSS. 

10 Cytolytic activity is determined in a standard 4-h, split well 51 Cr release assay using U-bottomed 96 well 

plates containing 3,000 targets/well. Stimulated PBMC are tested at effector/target (E/T) ratios of 20-50: 1 on day 
14. Percent cytotoxicity is determined from the formula: 100 x [(experimental release-spontaneous 
release)/maximum release-spontaneous release)]. Maximum release is determined by lysis of targets by detergent 
(2% Triton X-100; Sigma Chemical Co., St. Louis, MO). Spontaneous release is <25% of maximum release for all 

15 experiments. 

The results of such an analysis indicate the extent to which HLA-restricted CTL populations have been 
stimulated by previous exposure to 101P3A1 1 or an 101P3A1 1 vaccine. 

Similarly, Class II restricted HTL responses may also be analyzed. Purified PBMC are cultured in a 96- 
well flat bottom plate at a density of 1 .5xl0 5 cells/well and are stimulated with 10 |ig/ml synthetic peptide of the 

20 invention, whole 101P3A1 1 antigen, or PHA. Cells are routinely plated in replicates of 4-6 wells for each 

condition. After seven days of culture, the medium is removed and replaced with fresh medium containing lOU/ml 
IL-2. Two days later, 1 jiCi 3 H-thymidine is added to each well and incubation is continued for an additional 18 
hours. Cellular DNA is then harvested on glass fiber mats and analyzed for 3 H-thymidine incorporation. Antigen- 
specific T cell proliferation is calculated as the ratio of 3 H-thymidine incorporation in the presence of antigen 

25 divided by the 3 H-thymidine incorporation in the absence of antigen. 

Example 24: Induction Of Specific CTL Response In Humans 

A human clinical trial for an immunogenic composition comprising CTL and HTL epitopes of the 
invention is set up as an IND Phase I, dose escalation study and carried out as a randomized, double-blind, placebo- 
30 controlled trial. Such a trial is designed, for example, as follows: 

A total of about 27 individuals are enrolled and divided into 3 groups: 

Group I: 3 subjects are injected with placebo and 6 subjects are injected with 5 |i,g of peptide composition; 
Group II: 3 subjects are injected with placebo and 6 subjects are injected with 50 |ig peptide composition; 
Group III: 3 subjects are injected with placebo and 6 subjects are injected with 500 |Hg of peptide 
35 composition. 

After 4 weeks following the first injection, all subjects receive a booster inoculation at the same dosage. 

The endpoints measured in this study relate to the safety and tolerability of the peptide composition as well 
as its immunogenicity. Cellular immune responses to the peptide composition are an index of the intrinsic activity 
of this the peptide composition, and can therefore be viewed as a measure of biological efficacy. The following 
40 summarize the clinical and laboratory data that relate to safety and efficacy endpoints. 
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Safety: The incidence of adverse events is monitored in the placebo and drug treatment group and 
assessed in terms of degree and reversibility. 

Evaluation of Vaccine Efficacy: For evaluation of vaccine efficacy, subjects are bled before and after 
injection. Peripheral blood mononuclear cells are isolated from fresh heparinized blood by Ficoll-Hypaque density 
5 gradient centrifugation, aliquoted in freezing media and stored frozen. Samples are assayed for CTL and HTL 
activity. 

The vaccine is found to be both safe and efficacious. 


Example 25: Phase II Trials In Patients Expressing 101P3A11 

10 Phase II trials are performed to study the effect of administering the CTL-HTL peptide compositions to 

patients having cancer that expresses 101P3A1 1 . The main objectives of the trial are to determine an effective dose 
and regimen for inducing CTLs in cancer patients that express 101P3A1 1, to establish the safety of inducing a CTL 
and HTL response in these patients, and to see to what extent activation of CTLs improves the clinical picture of 
these patients, as manifested, e.g., by the reduction and/or shrinking of lesions. Such a study is designed, for 

15 example, as follows: 

The studies are performed in multiple centers. The trial design is an open-label, uncontrolled, dose 
escalation protocol wherein the peptide composition is administered as a single dose followed six weeks later by a 
single booster shot of the same dose. The dosages are 50, 500 and 5,000 micrograms per injection. Drug- 
associated adverse effects (severity and reversibility) are recorded. 

20 There are three patient groupings. The first group is injected with 50 micrograms of the peptide 

composition and the second and third groups with 500 and 5,000 micrograms of peptide composition, respectively. 
The patients within each group range in age from 21-65 and represent diverse ethnic backgrounds. All of them 
have a tumor that expresses 101P3A1 1. 

Clinical manifestations or antigen- specific T-cell responses are monitored to assess the effects of 

25 administering the peptide compositions. The vaccine composition is found to be both safe and efficacious in the 
treatment of 101P3A1 1 -associated disease. 

Example 26: Induction of CTL Responses Using a Prime Boost Protocol 

A prime boost protocol similar in its underlying principle to that used to confirm the efficacy of a DNA 
30 vaccine in transgenic mice, such as described above in the Example entitled "The Plasmid Construct and the Degree 
to Which It Induces Immunogenicity," can also be used for the administration of the vaccine to humans. Such a 
vaccine regimen can include an initial administration of, for example, naked DNA followed by a boost using 
recombinant virus encoding the vaccine, or recombinant protein/polypeptide or a peptide mixture administered in an 
adjuvant. 

35 For example, the initial immunization may be performed using an expression vector, such as that 

constructed in the Example entitled "Construction of 'Minigene' Multi-Epitope DNA Plasmids" in the form of 
naked nucleic acid administered IM (or SC or ID) in the amounts of 0.5-5 mg at multiple sites. The nucleic acid 
(0.1 to 1000 jig) can also be administered using a gene gun. Following an incubation period of 3-4 weeks, a booster 
dose is then administered. The booster can be recombinant fowlpox virus administered at a dose of 5-10 7 to 5x10° 

40 pfu. An alternative recombinant virus, such as an MVA, canarypox, adenovirus, or adeno-associated virus, can also 
be used for the booster, or the polyepitopic protein or a mixture of the peptides can be administered. For evaluation 
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of vaccine efficacy, patient blood samples are obtained before immunization as well as at intervals following 
administration of the initial vaccine and booster doses of the vaccine. Peripheral blood mononuclear cells are 
isolated from fresh heparinized blood by Ficoll-Hypaque density gradient centrifugation, aliquoted in freezing 
media and stored frozen. Samples are assayed for CTL and HTL activity. 
5 Analysis of the results indicates that a magnitude of response sufficient to achieve a therapeutic or 

protective immunity against 101P3A1 1 is generated. 

Example 27: Administration of Vaccine Compositions Using Dendritic Cells (DC) 

Vaccines comprising peptide epitopes of the invention can be administered using APCs, or "professional" 

10 APCs such as DC. In this example, peptide-pulsed DC are administered to a patient to stimulate a CTL response in 
vivo. In this method, dendritic cells are isolated, expanded, and pulsed with a vaccine comprising peptide CTL and 
HTL epitopes of the invention. The dendritic cells are infused back into the patient to elicit CTL and HTL 
responses in vivo. The induced CTL and HTL then destroy or facilitate destruction, respectively, of the target cells 
that bear the 101P3A1 1 protein from which the epitopes in the vaccine are derived. 

1 5 For example, a cocktail of epitope-comprising peptides is administered ex vivo to PBMC, or isolated DC 

therefrom. A pharmaceutical to facilitate harvesting of DC can be used, such as Progenipoietin™ (Monsanto, St. 
Louis, MO) or GM-CSF/IL-4. After pulsing the DC with peptides, and prior to reinfiision into patients, the DC are 
washed to remove unbound peptides. 

As appreciated clinically, and readily determined by one of skill based on clinical outcomes, the number of 

20 DC reinfused into the patient can vary (see, e.g., Nature Med. 4:328, 1998; Nature Med. 2:52, 1996 and Prostate 

32:272, 1997). Although 2-50 x 10 6 DC per patient are typically administered, larger number of DC, such as 10 7 or 
10 8 can also be provided. Such cell populations typically contain between 50-90% DC. 

In some embodiments, peptide-loaded PBMC are injected into patients without purification of the DC. For 
example, PBMC generated after treatment with an agent such as Progenipoietin™ are injected into patients without 

25 purification of the DC. The total number of PBMC that are administered often ranges from 10 8 to 10 10 . Generally, 
the cell doses injected into patients is based on the percentage of DC in the blood of each patient, as determined, for 
example, by immunofluorescence analysis with specific anti-DC antibodies. Thus, for example, if Progenipoietin™ 
mobilizes 2% DC in the peripheral blood of a given patient, and that patient is to receive 5 x 10 6 DC, then the 
patient will be injected with a total of 2.5 x 10 8 peptide-loaded PBMC. The percent DC mobilized by an agent such 

30 as Progenipoietin™ is typically estimated to be between 2-10%, but can vary as appreciated by one of skill in the 
art. 

Ex vivo activation of CTL/HTL responses 

Alternatively, ex vivo CTL or HTL responses to 101P3A1 1 antigens can be induced by incubating, in 
tissue culture, the patient's, or genetically compatible, CTL or HTL precursor cells together with a source of APC, 
35 such as DC, and immunogenic peptides. After an appropriate incubation time (typically about 7-28 days), in which 
the precursor cells are activated and expanded into effector cells, the cells are infused into the patient, where they 
will destroy (CTL) or facilitate destruction (HTL) of their specific target cells, i.e., tumor cells. 


40 


Example 28: An Alternative Method of Identifying and Confirming Motif-Bearing Peptides 


Another method of identifying and confirming motif-bearing peptides is to elute them from cells bearing 
defined MHC molecules. For example, EBV transformed B cell lines used for tissue typing have been extensively 
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characterized to determine which HLA molecules they express. In certain cases these cells express only a single 
type of HLA molecule. These cells can be transfected with nucleic acids that express the antigen of interest, e.g. 
101P3A1 1 . Peptides produced by endogenous antigen processing of peptides produced as a result of transfection 
will then bind to HLA molecules within the cell and be transported and displayed on the cell's surface. Peptides are 
5 then eluted from the HLA molecules by exposure to mild acid conditions and their amino acid sequence 

determined, e.g., by mass spectral analysis (e.g., Kubo et al.,J. Immunol. 152:3913, 1994). Because the majority of 
peptides that bind a particular HLA molecule are motif-bearing, this is an alternative modality for obtaining the 
motif-bearing peptides correlated with the particular HLA molecule expressed on the cell. 

Alternatively, cell lines that do not express endogenous HLA molecules can be transfected with an 

10 expression construct encoding a single HLA allele. These cells can then be used as described, i.e., they can then be 
transfected with nucleic acids that encode 101P3A1 1 to isolate peptides corresponding to 101P3A1 1 that have been 
presented on the cell surface. Peptides obtained from such an analysis will bear motif(s) that correspond to binding 
to the single HLA allele that is expressed in the cell. 

As appreciated by one in the art, one can perform a similar analysis on a cell bearing more than one HLA 

1 5 allele and subsequently determine peptides specific for each HLA allele expressed. Moreover, one of skill would 
also recognize that means other than transfection, such as loading with a protein antigen, can be used to provide a 
source of antigen to the cell. 

Example 29: Complementary Polynucleotides 

20 Sequences complementary to the 101P3A1 1 -encoding sequences, or any parts thereof, are used to detect, 

decrease, or inhibit expression of naturally occurring 101P3A1 1. Although use of oligonucleotides comprising 
from about 15 to 30 base pairs is described, essentially the same procedure is used with smaller or with larger 
sequence fragments. Appropriate oligonucleotides are designed using, e.g., OLIGO 4.06 software (National 
Biosciences) and the coding sequence of 101P3A1 1 . To inhibit transcription, a complementary oligonucleotide is 

25 designed from the most unique 5' sequence and used to prevent promoter binding to the coding sequence. To 

inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 101P3A1 1- 
encoding transcript. 

Example 30: Purification of Naturally-occurring or Recombinant 101P3A11 Using 101P3A11 
30 Specific Antibodies 

Naturally occurring or recombinant 101P3A1 1 is substantially purified by immunoaffinity chromatography 
using antibodies specific for 101P3A1 1 . An immunoaffinity column is constructed by covalently coupling anti- 
101P3A1 1 antibody to an activated chromatographic resin, such as CrBr-activated SEPHAROSE (Amersham 
Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the manufacturer's 
35 instructions. 

Media containing 101P3A1 1 are passed over the immunoaffinity column, and the column is washed under 
conditions that allow the preferential absorbance of 101P3A1 1 (e.g., high ionic strength buffers in the presence of 
detergent). The column is eluted under conditions that disrupt antibody/1 01P3 A 1 1 binding (e.g., a buffer of pH 2 to 
pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and GCR.P is collected. 

40 

Example 31: Identification of Molecules Which Interact with 101P3A11 
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101P3A1 1, or biologically active fragments thereof, are labeled with 121 1 Bolton-Hunter reagent. (See, 
e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously arrayed in the wells of a multi-well 
plate are incubated with the labeled 101P3A1 1, washed, and any wells with labeled 101P3A1 1 complex are 
assayed. Data obtained using different concentrations of 101P3A1 1 are used to calculate values for the number, 
5 affinity, and association of 101P3A1 1 with the candidate molecules. 


Example 32: In Vivo Assay for 101P3A11 Tumor Growth Promotion 

The effect of the 101P3A1 1 protein on tumor cell growth can be confirmed in vivo by gene overexpression 
in a variety of cancer cells, including prostate, kidney, colon and bladder. For example, SCID mice can be injected 
10 SQ on each flank with 1 x 10 6 prostate, kidney, colon or bladder cancer cells (such as PC3, LNCaP, SCaBER, UM- 
UC-3, HT1376, SK-CO, Caco, RT4, T24, Caki, A-498 and SW839 cells) containing tkNeo empty vector or 
101P3A11. 

At least two strategies can be used: 

(1) Constitutive 101P3A1 1 expression under regulation of a promoter such as a constitutive promoter 

15 obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504 published 5 July 1989), 
adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, 
hepatitis-B virus and Simian Virus 40 (SV40), or from heterologous mammalian promoters, e.g., the actin promoter 
or an immunoglobulin promoter, provided such promoters are compatible with the host cell systems. 

(2) Regulated expression under control of an inducible vector system, such as ecdysone, tet, etc., can be 
20 used provided such promoters are compatible with the host cell systems. Tumor volume is then monitored at the 

appearance of palpable tumors and is followed over time to validate that 10 1P3A1 1 -expressing cells grow at a faster 
rate and that tumors produced by 101P3A1 1 -expressing cells demonstrate characteristics of altered aggressiveness 
(e.g., enhanced metastasis, vascularization, reduced responsiveness to chemotherapeutic drugs). Figure 21 
compares subcutaneous growth of control 3T3-neo and 3T3-101P3A1 1 cells. One million cells stably expressing 

25 neo or 101P3A1 1 were injected subcutaneously in SCID mice along with matrigel. Tumor volume was evaluated 
by caliper measurements. This experiment demonstrates that expression of 101P3A1 1 in NIH 3T3 cells is induces 
tumor formation in 6/6 mice. Figure 42 shows demonstrates that 101P3A1 1 induces orthotopic growth of tumors. 
The results indicated that expression of 101P3A1 1 is sufficient to induce tumor formation in vivo. Additionally, 
were implanted with the same cells orthotopically in the prostate to determine if 101P3A1 1 has an effect on local 

30 growth in the prostate or on the ability of the cells to metastasize, specifically to lungs or lymph nodes. In an 

analogous manner, cells cab be implanted orthotopically in , bladder, colon or kidney. (Saffran, D., et al., PNAS 
10:1073-1078; Fu,X., et a!., Int. J. Cancer, 1991. 49: p. 938-939; Chang, S., et al., Anticancer Res., 1997. 17: p. 
3239-3242; Peralta, E. A.,etal.,J. Urol., 1999. 162: p. 1806-1811). 

Furthermore, this assay is useful to confirm the 101P3A1 1 inhibitory effect of candidate therapeutic 

35 compositions, such as for example, 101P3A1 1 antibodies or intrabodies, and 101P3A1 1 antisense molecules or 

ribozymes, or 101P3A1 1 directed small molecules. In Figure 22, we depict the effect of a small molecule, pertussis 
toxin (PTX) on tumor formation by 3T3-101P3A1 1 cells. In this experiment, SCID mice were injected with 3T3- 
101P3A1 1 alone or in conjunction with PTX. Each mouse was given 5 doses of PTX at 3-4 days interval. Tumor 
volume was evaluated by caliper measurements. Figure 22 shows that PTX inhibits tumor growth in a dose 

40 dependent manner. In addition to demonstrating that 101 A3P1 1 plays an important role in tumor growth, Figures 
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21 and 22 identify a signaling pathway associated with 101P3A1 1 and indicate that 101P3A1 1 produced its effect 
on tumor growth by activating an adenylate cyclase dependent pathway. 

Example 33: 101P3A11 Monoclonal Antibody- media ted Inhibition of Tumors In Vivo 
5 The significant expression of 101P3A1 1 in cancer tissues, together with its restricted expression in normal 

tissues, makes 101P3A1 1 an excellent target for antibody therapy. In cases where the monoclonal antibody target is 
a cell surface protein, as is 101P3A1 1, antibodies have been shown to be efficacious at inhibiting tumor growth 
(See, e.g., Saffran, D., etal, PNAS 10:1073-1078 or www.onas.org/cgi/doi/10.1073/pnas.051624698) . In cases 
where the target is not on the cell surface, such as PSA and PAP in prostate cancer, antibodies have also been 

10 shown to recognize and inhibit growth of cells expressing those proteins (Saffran, D.C., et al., Cancer and 
Metastasis Reviews, 1999. 18: 437-449). As with any cellular protein with a restricted expression profile, 
101P3A1 1 is a target for T cell-based immunotherapy. 

Accordingly, the therapeutic efficacy of anti-101P3Al 1 mAbs in human colon, kidney, bladder and 
prostate cancer mouse models is modeled in 101P3A1 1 -expressing kidney, colon, bladder or prostate cancer 

15 xenografts or cancer cell lines, such as those described in the Example entitled "/« Vivo Assay for 101P3A1 1 
Tumor Growth Promotion", that have been engineered to express 101P3A1 1. 

Antibody efficacy on tumor growth and metastasis formation is confirmed, e.g., in a mouse orthotopic 
prostate, colon, bladder or kidney cancer xenograft model. The antibodies can be unconjugated, or can be 
conjugated to a therapeutic modality, as appreciated in the art. It is confirmed that anti-101P3Al 1 mAbs inhibit 

20 formation of 101P3A1 1 -expressing kidney, colon, bladder and prostate tumors. Anti-101P3A1 1 mAbs also retard 
the growth of established orthotopic tumors and prolong survival of tumor-bearing mice. These results indicate the 
utility of anti-101P3Al 1 mAbs in the treatment of local and advanced stages of cancer. (See, e.g., Saffran, D., et 
al., PNAS 10:1073-1078 or www.pnas.org/cgi/doi/10.1073/pnas.051624698) 

Administration of anti-101P3Al 1 mAbs retard established orthotopic tumor growth and inhibit metastasis 

25 to distant sites, resulting in a significant prolongation in the survival of tumor-bearing mice. These studies indicate 
that 101P3A1 1 is an attractive target for immunotherapy and demonstrate the therapeutic potential of anti- 
101P3A1 1 mAbs for the treatment of local and metastatic kidney, colon, bladder and prostate cancer. Similar 
studies manifest that 101P3A1 1 is safe and effective when used in combination with other therapeutic modalities 
such as surgery, radiation therapy, hormone therapy or chemotherapy. 

30 This example demonstrates that unconjugated 101P3A1 1 monoclonal antibodies effectively to inhibit the 

growth of human bladder tumors grown in SCID mice; accordingly a combination of such efficacious monoclonal 
antibodies is also effective. 

Example 34: Induction Of Specific CTL Response In Humans 

35 A human clinical trial for an immunogenic composition comprising CTL and HTL epitopes of the 

invention is set up as an IND Phase I, dose escalation study and carried out as a randomized, double-blind, placebo- 
controlled trial. Such a trial is designed, for example, as follows: 

A total of about 27 individuals are enrolled and divided into 3 groups: 

Group I: 3 subjects are injected with placebo and 6 subjects are injected with 5 jxg of peptide composition; 
40 Group II: 3 subjects are injected with placebo and 6 subjects are injected with 50 |lg peptide composition; 
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Group III: 3 subjects are injected with placebo and 6 subjects are injected with 500 jug of peptide 
composition. 

After 4 weeks following the first injection, all subjects receive a booster inoculation at the same dosage. 

The endpoints measured in this study relate to the safety and tolerability of the peptide composition as well 
5 as its immunogenicity. Cellular immune responses to the peptide composition are an index of the intrinsic activity 
of this the peptide composition, and can therefore be viewed as a measure of biological efficacy. The following 
summarize the clinical and laboratory data that relate to safety and efficacy endpoints. 

Safety: The incidence of adverse events is monitored in the placebo and drug treatment group and 
assessed in terms of degree and reversibility. 
10 Evaluation of Vaccine Efficacy: For evaluation of vaccine efficacy, subjects are bled before and after 

injection. Peripheral blood mononuclear cells are isolated from fresh heparinized blood by Ficoll-Hypaque density 
gradient centrifugation, aliquoted in freezing media and stored frozen. Samples are assayed for CTL and HTL 
activity. 

The vaccine is found to be both safe and efficacious. 

15 

Example 35: Splice Variants of 101P3A11 

Splice variants are also called alternative transcripts. When a gene is transcribed from genomic DNA, the 
initial RNA is generally spliced to produce functional mRNA, which has only exons and is used for translation into 
an amino acid sequence. Accordingly, a given gene can have zero to many alternatively spliced mRNA products. 

20 Alternative transcripts each have a unique exon makeup, and can have different coding and/or non-coding (5' or 3' 
end) portions, from the original transcript. Alternative transcripts can code for similar proteins with the same or a 
similar function or may encode proteins with different functions, and may be expressed in the same tissue at the 
same time, or at different tissue at different times, proteins encoded by alternative transcripts can have similar or 
different cellular or extracellular localizations, e.g., be secreted. 

25 Splice variants are identified by a variety of art-accepted methods. For example, splice variants are 

identified by use of EST data. First, all human ESTs were grouped into clusters which show direct or indirect 
identity with each other. Second, ESTs in the same cluster were further grouped into sub-clusters and assembled 
into a consensus sequence. The starting gene is compared to the consensus sequence(s). Each consensus sequence 
is a potential splice variant for that gene (see, e.g., 

30 http://www.doubletwist.com/products/cl 1 agentsOverview.ihtml) . Even when a variant is identified that is not a 
full-length clone, that portion of the variant is very useful for antigen generation and for further cloning of the full- 
length splice variant, using techniques known in the art. 

Moreover, computer programs are available in the art that identify splice variants based on genomic 
sequences. Genomic-based variant identification programs include FgenesH (A. Salamov and V. Solovyev, "Ab 

35 initio gene finding in Drosophila genomic DNA," Genome Research. 2000 April;10(4):516-22); Grail 

( http://compbio.ornl.gov/Grail-bin/EmptyGrailForm) and GenScan ( http://genes.mit.edu/GENSCAN.html) . For a 
general discussion of splice variant identification protocols see., e.g., Southan, C., "A genomic perspective on 
human proteases," FEBS Lett. 2001 Jun 8;498(2-3):214-8; de Souza, SJ, et al., "Identification of human 
chromosome 22 transcribed sequences with ORF expressed sequence tags," Proc. Natl Acad Sci USA. 2000 Nov 

40 7;97(23): 12690-3. 
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For variants identified by the EST-based method, Table XXI shows the nucleotide sequences of the splice 
variants. Table XXII shows the alignment of the splice variant with the 101P3A1 1 nucleic acid sequence. Table 
XXIII displays the single longest alignment of an amino acid sequence encoded by a splice variant, out of all six 
potential reading frames with 101P3A1 1. Thus, for each splice variant, a variant's reading frame that encodes the 
5 longest single contiguous peptide homology between 101P3A1 1 and the variant is the proper reading frame 

orientation for the variant. Due to the possibility of sequencing errors in EST or genomic data, other peptides in the 
relevant reading frame orientation (5' to 3' or 3' to 5') can also be encoded by the variant. Table XXIV lays out all 
three frame shifted amino acid translations of the splice variant for the identified reading frame orientation. Tables 
XXI through XXIV are set forth herein on a variant-by- variant basis. 

10 For variants identified by any one of the genomic sequence-based methods, Table XXI shows the 

nucleotide sequences of the splice variant. Table XXII shows the alignment of the splice variant with the 
101P3A1 1 nucleic acid sequence. Table XXIII displays the alignment of amino acid sequence of the predicted 
transcript(s) with 101P3A1 1 . The genomic-based computer programs predict a transcript from genomic sequence, 
and not only predict exons but also set open reading frame as the first forward open reading frame. The predicted 

15 transcript does not contain 5 1 or 3* untranslated region (UTR). It starts with ATG and ends with a stop codon, TAG, 
TGA or TAA. In case the transcript is predicted on the reverse strand of the genomic sequence, the sequence of the 
transcript is reverse-complemented to the genomic sequence of the exons. Thus, the genomic-based programs 
provide the correct transcript sequence, with 5' to 3' orientation and +1 as the open reading frame. However, due to 
the possibility of inaccurate prediction of exons or the possibility of sequencing errors in genomic data, other 

20 peptides in other forward open reading frames can also be encoded by the variant. Table XXIV lays out all amino 
acid translations of the splice variant in each of the three forward reading frames. Tables XXI through XXIV are 
set forth herein on a variant-by- variant basis. 

To further confirm the parameters of a splice variant, a variety of techniques available in the art are used, 
such as full-length cloning, proteomic validation, PCR-based validation, and 5' RACE validation, etc. (see e.g., 

25 Proteomic Validation: Brennan SO, Fellowes AP, George PM.; "Albumin banks peninsula: a new termination 

variant characterized by electrospray mass spectrometry." Biochim Biophys Acta. 1999 Aug 17;1433(l-2):321-6; 
Ferranti P, et al., "Differential splicing of pre-messenger RNA produces multiple forms of mature caprine 
alpha(sl)-casein." Eur J Biochem. 1997 Oct 1 ;249(l):l-7; PCR-based Validation: Wellmann S, et al., "Specific 
reverse transcription-PCR quantification of vascular endothelial growth factor (VEGF) splice variants by 

30 LightCycler technology." Clin Chem. 2001 Apr;47(4):654-60; Jia HP, et al., Discovery of new human beta- 
defensins using a genomics-based approach," Gene. 2001 Jan 24;263(1-2):21 1-8; PCR-based and 5' RACE 
Validation: Brigle I.E., et al., "Organization of the murine reduced folate carrier gene and identification of variant 
splice forms," Biochim Biophys Acta. 1997 Aug 7; 1353(2): 191-8. 

It is known in the art that genomic regions are modulated in cancers. When the genomic region to which 

35 101P3A1 1 maps is modulated in a particular cancer, the splice variants of 101P3A1 1 are modulated as well. 
Disclosed herein is that 101P3A1 1 has a particular expression profile. Splice variants of 101P3A1 1 that are 
structurally and/or functionally similar to 101P3A1 1 share this expression pattern, thus serving as tumor-associated 
markers/antigens. 

Using the EST assembly approach, we identified one splice variant. 

40 
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Example 36: Splice Variant Protein Characteristics 

The present variant protein set forth in Table XXI is understood to be partial, and thus to comprise 
domains of the full protein. Nucleotides 1-217 of variant align with nucleotides 66-290 of 101P3A1 1. Similarly, 
amino acids 3-155 of the 1-1P3A1 1 variant protein align with amino acids 70-222 of 101P3A1 1 with 100% 
5 identity, while the remaining downstream amino acids diverge from the 101P3A1 1 . This pattern of high homology 
to one section of the parent protein coupled to a high divergence from the remaining portions of the parent protein 
form the hallmark of a splice variant. 

Protein blast analysis of 101P3A1 1 variant shows that the 101P3A1 1 variant is homologous to the human 
olfactory receptor 52D1 (gi 14423837) with 35% identify over 68 amino acids located towards the amino terminal 
10 of 101P3A1 1 variant 1. 101P3A1 1 variant 1 shows homology to rat olfactory receptor RAlc (gi 3420759) with 
53% identity over 32 amino acids. Analysis by pFam or Prosite did not identify any motifs. 

Based on TMPred bioinformatic analysis the 101P3A1 1 variant appears to contain one transmembrane 
domain. In contrast, Sosui predicts no transmembrane domain but suggests that 101P3A1 1 variant 1 forms a 
soluble intracellular protein. Due to its homology to 101P3A1 1 and olfactory receptors, the 101P3A1 1 variant 1 is 
1 5 associated with cancer and signaling. 

Example 37: Identification of 101P3A11 sequences involved in ligand binding. 

As shown in Figure 4, the transmembrane regions of 101P3A1 1 and mouse olfactory receptor S25 
(ORS25)predicted using the TMHMM algorithm are highlighted in gray. The amino acids of ORS25 predicted by 
20 Floriano, et al. to be involved in binding of the ligand hexanol and/or involved in the formation of the ligand 

binding pocket are italicized and bolded in Figure 4, and are listed below. (Floriano, W.B., et al, 2000, Proc. Natl. 
Acad. Sci., USA, 97:10712-10716) 


Leu 131 

Ala 230 

Val 134 

lie 231 

Val 135 

Gly 234 

Gly 138 

Thr 284 

Thrl39 

Phe 287 

Ser 193 

Gin 300 

Ser 197, 

Lys 302 

Phe 225 



Sequences of 101P3A1 1 involved in ligand binding are identified based on homology to mouse olfactory 
receptor S25. Shown is the amino acid alignment of 101P3A1 1 with mouse olfactory receptor S25 depicting the 

35 predicted transmembrane domains of each GPCR. The amino acids of S25 involved in the recognition and binding 
of its ligand hexanol or that lie in the proximity of the binding pocket (Floriano, W.B., et al, 2000, Proc. Natl. Acad. 
Sci., USA, 97:10712-10716), are also shown. These amino acids lie close to or within the transmembrane domains 
of ORS25. Accordingly, the structurally homologous regions of 101P3A1 1 are involved in the binding of its 
cognate ligand. These regions encode the amino acids of the first extracellular loop and of the amino terminal end 

40 of transmembrane domain 3 (amino acids 82-1 12), the amino acids at the carboxyl terminal end of transmembrane 
domain 4 and into the second extracellular loop (amino acids 160-185), the amino acids at the end of the second 
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extracellular loop and into transmembrane domain 5 (amino acids 186-212), and the amino acids at the carboxyl 
terminal end of transmembrane domain 6, the third extracellular loop, and the amino terminal end of 
transmembrane domain 7 (amino acids 250-280). Thus, ligands of 101P3A1 1 are identified that interact with at 
least 3 of the following regions of 101P3A1 1: amino acids 82-1 12, amino acids 160-185, amino acids 186-212, 
5 and, amino acids 250-280. 

Example 38: Homology Comparison of 101P3A11 to Known Sequences 

The 101P3A1 1 protein of Figure 3 has 317 amino acids with calculated molecular weight of 35.2 kDa, and 
pi of 8.7. 101P3A1 1 is predicted to be a cell surface protein. Cellular localization was demonstrated by FACS 
analysis and immunofluorescence in cells engineered to express 101P3A1 1. 

10 101 P3 All shows best homology to rat olfactory receptor RA1 c (gi 3420759, 

http://www.ncbi.nlm.nih.gov) sharing 59% identity and 76% homology with that protein. 101P3A1 1 also shows 
homology to human prostate specific GPCR (gi 13540539) and human olfactory receptor 51112 (gi 14423836), 
sharing 59% identities/ 77% homology, and 53% identities/ 69% homology with each, respectively (Figures 23-25). 
Sequence and motif analysis indicate that 101P3A1 1 belongs to the family of olfactory receptors. 

15 Bioinformatic analysis revealed 101P3A1 1 to be a 7 transmembrane protein, with strong domain and structural 
homology to G-protein coupled receptors (GPCRs) (see Table XXV, TM Pred, Sosui, Pfam, Blocks, Print). 
Proteins that are members of the G-protein coupled receptor family exhibit an extracellular amino-terminus, three 
extracellular loops, three intracellular loops and an intracellular carboxyl terminus. G-protein coupled receptors are 
seven-transmembrane receptors that are stimulated by polypeptide hormones, neurotransmitters, chemokines and 

20 phospholipids (Civelli O et al, Trends Neurosci. 2001, 24:230; Vrecl M et al Mol Endocrinol. 1998, 12:1818). 
Ligand binding traditionally occurs between the first and second extracellular loops of the GPCR. Upon ligand 
binding GPCRs transduce signals across the cell surface membrane by associating with trimeric G proteins. Their 
signals are transmitted via trimeric guanine-nucleotide binding proteins (G proteins) to effector enzymes or ion 
channels (Simon et al., 1991, Science 252: 802). Signal transduction and biological output mediated by GPCR can 

25 be modulated through various mechanisms including peptide mimics, small molecule inhibitors and GPCR kinases 
or GRK (Pitcher JAetal, J Biol Chem. 1999, 3;274:34531; Fawzi AB, et al. 2001, Mol. Pharmacol., 59:30). 

Recently, GPCRs have also been shown to link to mitogenic signaling pathways of tyrosine kinases 
(Luttrell et al., 1999, Science 283: 655; Luttrell et al., 1999 Curr Opin Cell Biol 11: 177). GPCRs are regulated by 
phosphorylation mediated by GPCR kinases (GRKs), which themselves are indirectly activated by the GPCRs 

30 (Pitcher et al., 1998, Ann. Rev. Biochem. 67:653). Olfactory GPCRs transmit their signals by activating the 

cAMP pathway via adenylate cyclase resulting in downstream signaling to protein kinase A, and by activating the 
phospholipase C pathway by generating inositol 1,4,5-trisphosphate (IP3) and diacyl-glycerol (DAG) (Breer, 1993, 
Ciba Found Symp 179: 97; Bruch, 1996, Comp Biochem Physiol B Biochem Mol Biol 1 13:451). IP3 results in an 
increase in intracellular calcium, while DAG activates protein kinase C. 

35 Recent studies have associated GPCRs with cellular transformation. In particular, KSHV G protein- 

coupled receptor was found to transform NIH 3T3 cells in vitro and induces multifocal KS-like lesions in KSHV- 
GPCR-transgenic mice (Schwarz M, Murphy PM. J Immunol 2001, 167:505). KSHV-GPCR was capable of 
producing its effect on endothelial cells and fibroblasts by activating defined signaling pathways, including the 
AKT survival pathway (Montaner S et al, Cancer Res 2001, 61:2641). In addition, KSHV-GPCR induced the 

40 activation of mitogenic pathways such as AP-1 and NFkB, resulting in the expression of pro- inflammatory genes 
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(Schwarz M, Murphy PM. J Immunol 2001, 167:505). Other GPCR associated with tumor formation include G2A, 
and the PAR-1, which has been found to induce transformation of NIH 3T3 cells (Whitehead I et al, Oncogene 
2001,20:1547). 

This information indicates that 101P3A1 1 plays a role in the transformation of mammalian cells, induces 
5 mitogenic responses including activation of various signaling pathways, and regulate gene transcription by 

transmitting cell surface signals to the nucleus, see also, the Example entitled, "In Vivo Assay for 101P3A1 1 Tumor 
Growth Promotion". 

Accordingly, when 101P3A1 1 functions as a regulator of cell transformation, tumor formation, or as a 
modulator of transcription involved in activating genes associated with inflammation, tumorigenesis or 
10 proliferation, 101P3A1 1 is used for therapeutic, diagnostic, prognostic and/or preventative purposes, in manners 
analogous to or that track other GPCRs as discussed herein and in the art.. 

Example 39: Identification and Confirmation of Potential Signal Transduction Pathways 

Many mammalian proteins have been reported to interact with signaling molecules and to participate in 
15 regulating signaling pathways. (J Neurochem. 2001; 76:217-223). In particular, GPCRs have been reported to 

activate MAK cascades as well as G proteins, and been associated with the EGFR pathway in epithelial cells (Naor, 
Z., et al, Trends Endocrinol Metab. 2000,11:91; Vacca F et al, Cancer Res. 2000,60:5310; Delia Rocca G.J., et 
al, J Biol Chem. 1999, 274:13978). Using immunoprecipitation and Western blotting techniques, proteins are 
identified that associate with 101P3A1 1 and mediate signaling events. Several pathways known to play a role in 
20 cancer biology can be regulated by 101P3A1 1, including phospholipid pathways such as PI3K, AKT, etc, adhesion 
and migration pathways, including FAK, Rho, Rac-1, etc, as well as mitogenic/survival cascades such as ERK, p38, 
etc (Cell Growth Differ. 2000,1 1:279; J Biol Chem. 1999, 274:801; Oncogene. 2000, 19:3003; J. Cell Biol. 1997, 
138:913). 

Using Western blotting and other techniques, the ability of 101P3A1 1 to regulate these pathways is 

25 confirmed. Cells expressing or lacking 101P3A1 1 are either left untreated or stimulated with cytokines, androgen 
and anti-integrin antibodies. Cell lysates were analyzed using anti-phospho-specific antibodies (Cell Signaling, 
Santa Cruz Biotechnology) in order to detect phosphorylation and regulation of ERK, p38, AKT, PI3K, PLC and 
other signaling molecules. Using such techniques, we showed that 101P3A1 1 alters the tyrosine phosphorylation 
pattern of NIH 3T3 cells (Figure 26) indicating that 101P3A1 1 is regulating protein kinases and phosphatases. In 

30 the experiment shown in Figure 26, control 3T3-neo and 3T3-101P3A1 1 cells were either treated with 0.5 or 10% 
FBS and whole cell lysates were analyzed by anti-phosphotyrosine Western blotting. Expression of 101P3A1 1 
resulted in reduced phosphorylation of several proteins in NIH-3T3 cells, while inducing the phosphorylation of 
proteins at 79-81 and 28-32 kDa. 

Using anti-Phospho-ERK antibodies, we demonstrated that expression of 101P3A1 1 induced ERK 

35 phosphorylation in the prostate cancer cell line PC3 (Figures 27 A and Figure 27B), and that ERK phosphorylation 
in 101P3A1 1 expressing cells was regulated by GPCR ligands. In this experiment, control PC3-neo cells and PC3- 
101P3A1 1 cells were left untreated (0.1% FBS) or were stimulated with 10% FBS, lipophosphatidic acid (LP A), 
gastrin releasing peptide (GRP), leukotriene (LKB4) or platelet activating factor (PAF). The cells were lysed and 
analyzed by Western blotting using anti-Phospho-ERK (Figure 27A) or anti-ERK (Figure 27B) mAb. The results 

40 showed that expression of 101P3A1 1 mediated significant ERK phosphorylation by FBS, LP A, GRP and PAF, 

while LKB4 resulted in a more modest level of ERK phosphorylation in PC3-101P3A1 1 cells. In contrast, none of 
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the GPCR ligands induced significant ERK phosphorylation in PC3-Neo cells, demonstrating the specificity of 
GPCR ligands-mediated responses in 101P3A1 1 expressing cells. The ERK overlay demonstrated equal loading, 
supporting the specificity of this data. In order to delineate the signaling pathway by which 101P3A1 1 mediates 
ERK phosphorylation in cancer cells, it was confirmed which of the two pathway inhibitors: MEK inhibitor 
5 PD98059 or the p38 inhibitor SB203580 regulate 101P3A1 1 mediated ERK phosphorylation (Figure 28). To obtain 
this data, PC3-neo and PC3-101P3A1 1 cells were treated with media alone or in the presence of PD98059, 
SB203580, or genistein were stimulated with FBS or GRP. Cells were lysed and analyzed by Western blotting 
using anti-Phospho-ERK or anti-ERK mAb. Treatment with 10% FBS or with GRP induced the phosphorylation of 
ERK in PC3-101P3A1 1 but not in control PC3-neo cells. 101P3A1 1 -mediated ERK phosphorylation was inhibited 

10 by the MEK-1 inhibitor PD98059 but not the p38 inhibitor SB203580 or genistein. The ERK overlay demonstrated 
equal loading, supporting the specificity of the results. These results were confirmed by those obtained in two 
additional sets of experiments. The inhibition of 101P3A1 1-mediated ERK phosphorylation by PD98059 
demonstrates that 101P3A1 1 activated the classical MEK-ERK cascade, a pathway associated with mitogenesis, 
proliferation and tumorigenesis. 

15 Results in Figures 26-28 indicate that 101P3A1 1 regulates the activity of kinases, including ERK, and 

phosphatases. In order to confirm the association of 101P3A1 1 with phosphatase activity, the effect of the protein 
phosphatase inhibitor sodium orthovanadate on 101P3A1 1 mediated ERK phosphorylation was determined (Figure 
29). PC3-neo and PC3-101P3A1 1 cells were grown in media alone or in the presence of sodium orthovanadate 
(Na3V04), and were stimulated with 0.1% or 10% FBS. Cells were lysed and analyzed by Western blotting using 

20 anti-Phospho-ERK or anti-ERK mAb. Treatment with Na3 V04 resulted in a 4.5-fold increase in ERK 

phosphorylation in PC3-101P3A1 1 cells, compared to a two-fold increase in PC3-neo cells. Results in Figure 29 
confirm the contribution of protein phosphatases to 101P3A1 1 mediated signaling. 

Several GPCRs have been shown to transactivate receptor tyrosine kinases associated with the cell 
membrane, such as the EGF receptor (EGFR) (Pierce K.L., et al, J Biol Chem. 2001,276:23155; Nath, D., et al, J 

25 Cell Sci. 2001, 1 14:1213). In order to determine whether 101P3A1 1 signaling results in the activation of EGFR, 

we compared the effect of the EGFR inhibitor, AG1517, on EGFR- and 101P3A1 1-mediated ERK phosphorylation 
(Figure 30). In Figure 30, PC3-neo and PC3-101P3A1 1 cells were grown in media alone (0.1% FBS) or in the 
presence of AG1 5 1 7. The cells were stimulated with 0.1% or 10% FBS, GRP or EGF, lysed and analyzed by 
Western blotting using anti-Phospho-ERK or anti-ERK mAb. Treatment with 10% FBS, GRP and EGF induced 

30 ERK phosphorylation in PC3-101P3A1 1 cells. ERK phosphorylation by EGF was completely inhibited by 

AG1517. 101P3A1 1 mediated ERK phosphorylation in cells treated with 10% FBS was partially inhibited by 
AG1517. Data in Figure 30 indicate that some cross talk occurred between 101P3A1 1 and EGFR signaling 
pathways. 

In addition to activating the ERK cascade, 101P3A1 1 activated a parallel MAK pathway, namely p38. In 
35 Figure 31 A and Figure 31B, PC3-neo and PC3-101P3A1 1 cells were grown in 1% or 10% FBS. Cells were lysed 
and analyzed by Western blotting using anti-Phospho-p38 (Figure 31 A) or anti-p38 (Figure 3 IB) monoclonal 
antibody (mAb). Our results demonstrate that expression of 101P3A1 1 mediated p38 phosphorylation in cells 
treated with 10% FBS. Equal loading was demonstrated in the p38 overlay. 

Results shown in Figures 26-30 and Figure 31A-31B confirm that 101P3A1 1 activates several signaling 
40 pathways in cancer cells, including the ERK and p38 cascades. In addition to MAPK, 101P3A1 1 signaling was 
associated with protein phosphatase activity and EGFR transactivation. These signaling pathways have been 
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associated with cell growth, survival and transcriptional activation, all of which play an important role in tumor 
initiation and progression. When 101P3A1 1 plays a role in the regulation of signaling pathways, whether 
individually or communally, it is used as a target for diagnostic, prognostic, preventative and/or therapeutic 


purposes. 


5 


To confirm that 101P3A1 1 directly or indirectly activates known signal transduction pathways in cells, 
luciferase (luc) based transcriptional reporter assays are carried out in cells expressing individual genes. These 
transcriptional reporters contain consensus-binding sites for known transcription factors that lie downstream of 
well-characterized signal transduction pathways. The reporters and examples of these associated transcription 
factors, signal transduction pathways, and activation stimuli are listed below. 


10 


1. 


NFkB-luc, NFkB/Rel; Ik-kinase/S APK; growth/apoptosis/stress 


2. 


SRE-luc, SRF/TCF/ELK1; MAPK/SAPK; growth/differentiation 
AP-l-luc, FOS/JUN; MAPK/SAPK/PKC; growth/apoptosis/stress 
ARE-luc, androgen receptor; steroids/MAPK; growth/differentiation/apoptosis 


4. 


15 


5. 


p53-luc, p53; SAPK; growth/differentiation/apoptosis 


6. 


CRE-luc, CREB/ATF2; PKA/p38; growth/apoptosis/stress 


Gene-mediated effects can be assayed in cells showing mRNA expression. Luciferase reporter plasmids 
can be introduced by lipid-mediated transfection (TFX-50, Promega). Luciferase activity, an indicator of relative 
20 transcriptional activity, is measured by incubation of cell extracts with luciferin substrate and luminescence of the 
reaction is monitored in a luminometer. 

Signaling pathways activated by 101P3A1 1 are mapped and used for the identification and validation of 
therapeutic targets. When 101P3A1 1 is involved in cell signaling, it is used as target for diagnostic, prognostic, 
preventative and/or therapeutic purposes. 


Example 40: 101P3A11 Functions as a GPCR 

Sequence and homology analysis of 101P3A11 indicated that 101P3A11 is a member of the olfactory 
receptor family of GPCR. Olfactory receptors are known to regulate biological responses by activating adenylate 
cyclase. In order to confirm that 101P3A1 1 functions as a GPCR and mediates the activation of adenylate cyclase, 

30 cAMP accumulation in PC3 and PC3-101P3A1 1 cells were compared (Figure 32). Control PC3 and PC3-101P3A1 1 
cells were grown in a low concentration of fetal bovine serum (FBS) for 14 hrs in the presence or absence of pertussis 
toxin (PTX). The cells were stimulated with 0.1% or 10% FBS, washed in PBS and lysed using a lysis buffer provided 
by Amersham Pharmacia. Intracellular concentration of cAMP was measured using a commercially available enzyme 
immunoassay (EI A) according to the manufacturer's recommendations (Amersham Pharmacia). Each assay was 

35 performed in duplicate. Calculations of c AMP concentrations were based on OD4 50 of the standard curve. Expression 
of 101P3A11 induced a four-fold increase in cAMP accumulation in the absence of stimulation. Treatment with 
10% FBS further enhanced cAMP accumulation in PC3-101P3A1 1 cells to nearly seven-fold over control PC3-neo 
cells. 101P3A11 mediated cAMP accumulation was inhibited by PTX. These results were confirmed by two 
separate sets of experiments. Results shown in Figure 32 demonstrate that 101P3A11 functions as a GPCR in 

40 prostate cancer cells and exhibits classical GPCR characteristics, such as cAMP accumulation that is inhibited by 


25 


PTX. 
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Since adenylate cyclase activity modulates intracellular levels of cAMP and induce downstream signaling 
events such as activation of protein kinase A, calcium and ERK MAPK signaling (Pierce K.L., et al, Oncogene. 
2001, 20:1532), we determined that PTX, an inhibitor of adenylate cyclase signaling, prevents 101P3A1 1 -mediated 
ERK phosphorylation along with inhibiting cAMP accumulation (Figure 33 and Figure 34). PC3-neo and PC3- 
5 101P3A1 1 cells were grown overnight in 0. 1% FBS in media alone or in the presence of pertussis toxin (PTX). 

Cells were stimulated with 0.1% or 10% FBS (Figure 33) or 10% FBS, EGF or GRP (Figure 34). Cells were lysed 
and analyzed by Western blotting using anti-Phospho-ERK mAb. Expression of 101P3A1 1 mediated ERK 
phosphorylation by 10% FBS in PC3 cells, which was inhibited by PTX (Figure 33 and Figure 34). In contrast, 
GRP and EGF-mediated ERK phosphorylation was relatively unaffected by PTX (Figure 34), demonstrating the 

10 specificity of 101P3A1 1 mediated responses. These results were replicated in additional experiments. 

GPCR transmit their signal by activating trimeric G proteins. Once GPCRs are activated, the associated 
Ga subunit binds GTP, dissociates from the receptor and participates in downstream signaling events (Schild, D., 
and Restrepo, D. Physiol Rev. 1998, 78:429-66). In order to determine that inhibition of Ga subunits has an effect 
on 101P3A1 1 mediated cell growth, the effect of two Ga inhibitors on the proliferation of 3T3-101P3A1 1 cells was 

15 investigated. Control 3T3 and 3T3-101P3A1 1 cells were grown in the presence or absence of suramin or its 

derivative NF 449 (Sigma). Cells were analyzed for proliferation 72 hours later (Figure 35). The experiment was 
performed in triplicate. The data showed that suramin and NF449 inhibited the proliferation of 3T3-101P3A1 1 
cells by 60% and 80%, respectively. This response was 101P3A1 1 specific as suramin and NF449 had no effect on 
the proliferation of control 3T3 cells. Thus, as 101P3A1 1 is involved in GPCR activity, it is used as target for 

20 diagnostic, prognostic, preventative and/or therapeutic purposes. 

GPCRs can be activated by a variety of ligands, including hormones, neuropeptides, chemokines, odorants 
and phospholipids. In the case of olfactory receptors, individual olfactory receptors may recognize multiple 
odorants, and can are activated by a diverse array of molecules. These ligands and molecules recognized by a 
receptor (as described above) are small molecules as described herein. 

25 In order to identify 101P3A1 1 (small molecule) ligand(s), the possibility that epithelial cells may be 

secreting 101P3A1 1 activators was investigated (Figure 36A and 36B). Prostate cancer epithelial cells, (PC3, PC3- 
101P3A1 1, LAPC4 2 hT), normal prostate cells (PrEC), fibroblasts (3T3, 3T3-101P3A1 1), and human kidney 
epithelial cells (293T) were grown in the presence or absence of FBS. Cell supernatants were collected and used to 
stimulate PC3 and PC3-101P3A1 1 cells. Cell lysates from resting and supernatant treated PC3 and PC3-101P3A1 1 

30 cells were lysed and analyzed by Western blotting with anti-Phospho-ERK (Figure 3 6 A) and anti-ERK (Figure 

36B) mAb. As shown in Figure 36 A and Figure 36B, supernatants form normal prostate cells, PrEC, and prostate 
cancer cells, PC3, PC3-101P3A1 1 and LAPC4 2 hT, induced the phosphorylation of ERK in PC3-101P3A1 1 but not 
control PC3 cells. In contrast, no specific ERK phosphorylation was observed using supernatants from 3T3 or 293T 
cells. Our results show that prostate cells, grown in the absence of serum, produce one or more factors that 

35 contribute to the activation of 101P3A1 1 mediated signaling events. Thus, as 101P3A1 1 responds to stimuli and 
functions in signaling and GPCR activity, it is used as target for diagnostic, prognostic, preventative and/or 
therapeutic purposes. 


Example 41: Inhibitors of 101P3A11 GPCR Function 

40 As mentioned in the Example entitled "Homology Comparison of 101P3A1 1 to Known Sequences," 

GPCRs are activated by ligand binding to the extracellular loops, resulting in the activation of trimeric G proteins 
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and the initiation of several signaling cascades. Using this information, several therapeutic and small molecule 
strategies are utilized to inhibit GPCR activation or downstream signaling events. 

One strategy inhibits receptor and ligand binding. Recent studies using several types of GPCRs, have 
demonstrated the effectiveness of this strategy (Fawzi AB, et al. 2001, Mol. Pharmacol., 59:30). Using a compound 
5 named SCH-202676, they inhibited agonist and antagonist binding to GPCRs by allosterically hindering ligand- 
GPCR interaction. Using this and even more specific allosteric (small molecule) inhibitors, signal transduction 
through 101P3A1 1 is inhibited, thereby providing therapeutic, prognostic, diagnostic and/or prophylactic benefit. 

A second approach is to inhibit G alpha subunit activation. Activation of GPCRs results in the exchange 
of GTP for GDP on the G alpha subunit of the trimeric G protein. Inhibition of Ga activation prevents the 

10 activation of downstream signaling cascades and therefore biological effects of GPCR. One molecule used to 
inhibit GDP exchange on Ga subunits is Suranim (Freissmuth M et al, 1996, Mol. Pharmacol, 49:602). Since 
suranim functions as a universal Ga inhibitor, it prevents the activation of most Ga subunits. Using techniques 
described, for example and without limitation, in the present Examples entitled "In Vivo Assay for 101P3A1 1 
Tumor Growth Promotion; " "Identification and Confirmation of Potential Signal Transduction Pathways," 

15 "101P3A1 1 Functions as a GPCR," and "Regulation of Transcription", small molecules are identified that 
selectively inhibit the Ga subunit that associates with 101P3A1 1, thereby providing therapeutic, prognostic, 
diagnostic and/or prophylactic benefit. 

A third approach is to inhibit Ga subunit association with GPCR. In order for trimeric G proteins to be 
activated following GPCR/ligand interaction, it is necessary for them to associate with their corresponding GPCR. 

20 Mutational analysis has mapped the interaction of Ga to the first and third intracellular loops of GPCRs (Heller R at 
al. 1996, Biochem. Biophys. Res. Commun). Several studies have used synthetic (small molecule) peptides 
corresponding to the intracellular sequence of loops 1 and 3 as inhibitors (Mukherjee, S., et al. 1999, J. Biol. 
Chem.). Using such short peptides that serve as receptor mimics, they are used to compete for binding of Ga 
subunits to 101P3A1 1 and thereby provide therapeutic, prognostic, diagnostic and/or prophylactic benefit. 

25 Thus, compounds and small molecules designed to inhibit 101P3A1 1 function and downstream signaling 

events are used for therapeutic diagnostic, prognostic and/or preventative purposes. 

Example 42: Involvement in Tumor Progression 

The 101P3A1 1 gene can contribute to the growth of cancer cells. The role of 101P3A1 1 in tumor growth 
30 is confirmed in a variety of primary and transfected cell lines including prostate, colon, bladder and kidney cell 

lines, as well as NIH 3T3 cells engineered to stably express 101P3A1 1. Parental cells lacking 101P3A1 1 and cells 
expressing 101P3A1 1 are evaluated for cell growth using a well-documented proliferation assay (Fraser SP, et al., 
Prostate 2000;44:61, Johnson DE, Ochieng J, Evans SL. Anticancer Drugs. 1996, 7:288). Using such a technique, 
we demonstrated (see Figure 37) that 101P3A1 1 imparts a growth advantage on NIH 3T3 cells. 3T3-neo and 3T3- 
35 101P3A1 1 cells were grown in 0.5% or 10% FBS and analyzed 48 hours later. The assay was performed in 

triplicate. Expression of 101P3A1 1 resulted in 6-fold increase in proliferation relative to control 3T3 cells grown in 
0.5% FBS. 

To confirm the role of 101P3A1 1 in the transformation process, its effect in colony forming assays was 
investigated. Parental NIH-3T3 cells lacking 101P3A1 1 were compared to NIH-3T3 cells expressing 101P3A11, 
40 using a soft agar assay under stringent and more permissive conditions (Song Z. et al. Cancer Res. 2000;60:6730). 
The results are shown in Figure 43, where 101P3A1 1 induces colony formation of over 100 fold increase relative to 
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neo resistant controls. We previously showed that expression of 101P3A1 1 in NIH 3T3 cells induces the growth of 
these cells in soft agar (129-24usul), indicating that 101P3A1 1 participates in the process of transformation. 

To confirm the role of 101P3A1 1 in invasion and metastasis of cancer cells, a well-established assay is 
used. A non-limiting example is the use of an assay which provides a basement membrane or an analog thereof 
5 used to detect whether cells are invasive (e.g., a Transwell Insert System assay (Becton Dickinson) (Cancer Res. 
1999; 59:6010)). Control cells, including prostate, colon, bladder and kidney cell lines lacking 101P3A1 1 are 
compared to cells expressing 101P3A1 1 . Cells are loaded with the fluorescent dye, calcein, and plated in the top 
well of a support structure coated with a basement membrane analog (e.g. the Transwell insert) and used in the 
assay. Invasion is determined by fluorescence of cells in the lower chamber relative to the fluorescence of the 

1 0 entire cell population. 

101P3A1 1 can also play a role in cell cycle and apoptosis. Parental cells and cells expressing 101P3A1 1 
are compared for differences in cell cycle regulation using a well-established BrdU assay (Abdel-Malek ZA. J Cell 
Physiol. 1988, 136:247). In short, cells are grown under both optimal (full serum) and limiting (low serum) 
conditions are labeled with BrdU and stained with anti-BrdU Ab and propidium iodide. Cells are analyzed for entry 

15 into the Gl, S, and G2M phases of the cell cycle. Alternatively, the effect of stress on apoptosis is evaluated in 
control parental cells and cells expressing 101P3A1 1, including normal and tumor prostate, colon and lung cells. 
Engineered and parental cells are treated with various chemotherapeutic agents, such as etoposide, flutamide, etc, 
and protein synthesis inhibitors, such as cycloheximide. Cells are stained with annexin V-FITC and cell death is 
measured by FACS analysis. The modulation of cell death by 101P3A1 1 can play a critical role in regulating tumor 

20 progression and tumor load. 

When 101P3A1 1 plays a role in cell growth, transformation, invasion or apoptosis, it is used as a target for 
diagnostic, prognostic, preventative and/or therapeutic purposes. 

E xample 43: Involvement in Angiogenesis 

25 Angiogenesis or new capillary blood vessel formation is necessary for tumor growth (Hanahan ,D.; 

Folkman, J.; Cell. 1996,86:353; FolkmanJ. Endocrinology. 1998 139:441). Several assays have been 
developed to measure angiogenesis in vitro and in vivo, such as the tissue culture assays endothelial cell tube 
formation and endothelial cell proliferation. Using these assays as well as in vitro neo- vascularization, the role of 
101P3A1 1 in angiogenesis, enhancement or inhibition, is confirmed . 

30 For example, endothelial cells engineered to express 101P3A1 1 are evaluated using tube formation and 

proliferation assays. The effect of 101P3A1 1 is also confirmed in animal models in vivo. For example, cells either 
expressing or lacking 101P3A1 1 are implanted subcutaneously in immunocompromised mice. Endothelial cell 
migration and angiogenesis are evaluated 5-15 days later using immunohistochemistry techniques. 101P3A1 1 
affects angiogenesis, and it is used as a target for diagnostic, prognostic, preventative and/or therapeutic purposes 

35 

Example 44: Regulation of Transcription 

The cell surface localization of 101P3A1 1 and its similarity to GPCRs indicate that 101P3A1 1 is 
effectively used as a modulator of the transcriptional regulation of eukaryotic genes. Regulation of gene expression 
is confirmed, e.g., by studying gene expression in cells expressing or lacking 101P3A1 1 . For this purpose, two 
40 types of experiments are performed. 
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In the first set of experiments, RNA from parental and 101P3A1 1 -expressing cells are extracted and 
hybridized to commercially available gene arrays (Clontech) (Smid-Koopman E et al. Br J Cancer. 2000. 83:246). 
Resting cells as well as cells treated with FBS or androgen are compared. Differentially expressed genes are 
identified in accordance with procedures known in the art. The differentially expressed genes are then mapped to 
5 biological pathways (Chen K et al. Thyroid. 2001. 11:41.). 

In the second set of experiments, specific transcriptional pathway activation is evaluated using 
commercially available (Stratagene) luciferase reporter constructs including: NFkB-luc, SRE-luc, ELKl-luc, ARE- 
luc, p53-luc, and CRE-luc. These transcriptional reporters contain consensus binding sites for known transcription 
factors that lie downstream of well-characterized signal transduction pathways, and represent a good tool to 
10 ascertain pathway activation and screen for positive and negative modulators of pathway activation. 

Thus, 101P3A1 1 plays a role in gene regulation, and it is used as a target for diagnostic, prognostic, 
preventative and/or therapeutic purposes. 


Example 45: Involvement in Cell Adhesion 

15 Cell adhesion plays a critical role in tissue colonization and metastasis. 101P3A1 1 can participate in 

cellular organization, and as a consequence cell adhesion and motility. To confirm that 101P3A1 1 regulates cell 
adhesion, control cells lacking 101P3A11 are compared to cells expressing 101P3A11, using techniques previously 
described (see, e.g., Haier et al, Br. J. Cancer. 1999, 80:1867; Lehr and Pienta, J. Natl. Cancer Inst. 1998, 
90: 1 18). Briefly, in one embodiment, cells labeled with a fluorescent indicator, such as calcein, are incubated on 

20 tissue culture wells coated with media alone or with matrix proteins. Adherent cells are detected by fluorimetric 
analysis and percent adhesion is calculated. In another embodiment, cells lacking or expressing 101P3A1 1 are 
analyzed for their ability to mediate cell-cell adhesion using similar experimental techniques as described above. 
Both of these experimental systems are used to identify proteins, antibodies and/or small molecules that modulate 
cell adhesion to extracellular matrix and cell-cell interaction. Cell adhesion plays a critical role in tumor growth, 

25 progression, and, colonization, and 101P3A1 1 is involved in these processes. Thus, it serves as a diagnostic, 
prognostic, preventative and/or therapeutic modality. 


Example 46: Protein-Protein Association 

Several GPCRs have been shown to interact with other proteins, thereby regulating signal transduction, 
30 gene transcription, transformation and cell adhesion (Sexton PM et al, Cell Signal. 2001, 13:73; Turner CE, J Cell 
Sci. 2000, 23:4139). Using immunoprecipitation techniques as well as two yeast hybrid systems, proteins are 
identified that associate with 101P3A1 1. Immunoprecipitates from cells expressing 101P3A1 1 and cells lacking 
101P3A1 1 are compared for specific protein-protein associations. 

Studies are performed to confirm the extent of association of 101P3A1 1 with effector molecules, such as 
35 receptors, adaptor proteins and paxillin, kinases, phsophates and Ga proteins. Studies comparing 101P3A1 1 

positive and 101P3A1 1 negative cells as well as studies comparing unstimulated/resting cells and cells treated with 
epithelial cell activators, such as cytokines, growth factors, androgen and anti-integrin Ab reveal unique 
interactions. 

In addition, protein-protein interactions are confirmed using two yeast hybrid methodology (Curr Opin 
40 Chem Biol. 1999, 3:64). A vector carrying a library of proteins fused to the activation domain of a transcription 

factor is introduced into yeast expressing a 101P3A1 1-DNA-binding domain fusion protein and a reporter construct. 
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Protein-protein interaction is detected by colorimetric reporter activity. Specific association with effector molecules 
and transcription factors directs one of skill to the mode of action of 101P3A1 1, and thus identifies therapeutic, 
prognostic, preventative and/or diagnostic targets for cancer. This and similar assays are also used to identify and 
screen for small molecules that interact with 101P3A1 1 . 
5 Thus it is found that 101P3A1 1 associates with proteins and small molecules. Accordingly, 101P3A1 land 

these proteins and small molecules are used for diagnostic, prognostic, preventative and/or therapeutic purposes. 

Example 47: Biological effect of anti-101P3All antibodies. 

The generation of anti-101P3Al 1 polyclonal Ab (pAb) using an amino-terminal peptide encoding amino 

10 acids 1-14 (MVDPNGNESSATYF; SEQ ID NO:2880) as antigen was reported in our Priority Application. The 
effect of this antibody on 101P3A1 1 mediated ERK phosphorylation (Figure 38) and cAMP accumulation (Figure 
39) was determined. 293T cells were transfected with control or 101P3A1 1 cDNA. Cells were allowed to rest 
overnight, and treated with anti-101P3Al 1 or control Ab in the presence of 0.5% or 10% FBS. Cells were lysed 
and analyzed by Western blotting with anti-Phospho-ERK and anti-ERK mAb. Figure 38 shows that expression of 

15 101P3A1 1 induces ERK phosphorylation in cells treated with 0.5 or 10% FBS. Anti-101P3A1 1 pAb reduced the 
phosphorylation of ERK in 293T-101P3A1 1 cells treated with 0.5% FBS. The ERK overlay demonstrated equal 
loading, supporting the specificity of this data. 

In order to confirm that anti-101P3Al 1 pAb has a detectable effect on cAMP accumulation, PC3 and PC3- 
101P3A1 1 ceils were grown in 0.1% FBS and treated with anti-101P3Al 1 pAb. Cells were analyzed for cAMP 

20 content as described in Figure 32. Expression of 101P3A1 1 induced a 5-fold increase in cAMP accumulation in 
PC3 cells, which was partially inhibited by PTX. Treatment of PC3-101P3A1 1 cells with anti-101P3Al 1 pAb 
resulted in a 4-fold increase in cAMP accumulation in PC3-101P3A1 1 but not control PC3 cells. Results shown in 
Figure 38 and Figure 39 indicate that anti-101P3Al 1 pAb produces a measurable biological effect in cells 
expressing 101P3A1 1. Accordingly, 101P3A1 1 is used as a target for diagnostic, prognostic, preventative and/or 

25 therapeutic purposes. 


Example 48: 101P3A11 Promoters 

A eukaryotic cells promoter is a short DNA sequence located in the 5' region of a gene. It provides 
binding sites for RNA polymerase and associated transcriptional cofactors, which upon assembly promotes 

30 transcription of the gene. In humans, most genes are transcribed by RNA polymerase II. The promoter DNA 
sequence normally contains binding motifs for RNA polymerases and its associated cofactors and activators 
including a TATA-box, cap signal, CCAAT-box and the GC-box. A eukaryotic cell enhancer is DNA sequence 
where transcriptional factors and their associated coactivators or suppressors bind and interact with promoter-bound 
RNA polymerase to regulate the expression of the gene located next to the promoter. While a promoter(s) locates 

35 close to the transcription starting site(s) of a gene (usually 25-30 base pairs), enhancers can be found up to 50,000 
base pairs in either direction to the transcription starting site(s) of the gene. There are many different gene 
regulatory proteins, namely transcription factors and their associated coactivators and cosuppressors that bind to 
specific enhancer sequences and regulate gene expression. These proteins, upon interaction with specific DNA 
regulatory sequences and with each other, allow each gene to be regulated up or down in different tissues and cell 

40 types. Chapter 9. "Control of gene expression" in Molecular Biology of the Cell. 3 rd ed. Ed. by Alberts et al., (New 
York and London) Garland Publishing, 1994). 
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Tissue specific gene expression is associated with the presence of specific combinations of transcription 
factors and their associated coactivators and suppressors, the presence of specific binding sites present in the DNA 
regulatory region of the gene for these factors, and the activation or inactivation of signaling pathways that 
modulate their relative activity. For example, prostate specific expression of prostate specific antigen, (PSA, or 
5 human kallikrein 3), is dependent on the presence of androgen receptor binding elements in defined 5' upstream 
enhancer and promoter sequences of the gene and intact androgen receptor signaling pathway (Pang S, et al., 
Cancer Res 1997 Feb 1 ; 57(3):495-9). It is also dependent on the presence of other cis-acting DNA regulatory 
elements in the promoter region (Zhang J, et al., Nucleic Acids Res. 1997, Aug. 1 ; 25(1 5):3 143-50.) and on the 
expression of other transcription factors, such as the prostate specific ets-like transcription factor (Oettgen P, et al., J 

10 Biol Chem. 2000, Jan. 14; 275(2): 12 16-25). 

With the accumulation of data and knowledge on human gene expression, promoters and enhancers are 
identified using different algorithms and computer programs (Chapter 8 "Gene Prediction" in Bioinformatics - 
Sequence and Genome Analysis, ed. by David W. Mount. Cold Spring Harbor Laboratory Press, 2001). 
Accordingly, we identified (Table XXX) promoters in a 5.04 kB 5' upstream genomic region of the 101P3A1 1 

1 5 coding sequence using Neural Network Promoter Prediction computer program 

(http://www.fruitfly.org/seq_tools/nnppAbst.html; Reese, M.G. and Eeckman, F.H. (1995) Novel Neural Network 
Algorithms for Improved Eukaryotic Promoter Site Recognition. Accepted talk for The Seventh International 
Genome Sequencing and Analysis Conference, Hyatt Regency, Hilton Head Island, South Carolina, September 16- 
20, 1995), indicated by the underlined sequences in Table XXIX. Using a program called SIGNAL SCAN 

20 (http://bimas.dcrt.nih.gov/molbio/signal/; Prestridge, D.S. (1991) SIGNAL SCAN: A computer program that scans 
DNA sequences for eukaryotic transcriptional elements. CABIOS 7, 203-206.), which searches a comprehensive 
database of regulatory element binding sites, we found numerous transcriptional binding sites for various known 
transcription factors in the 5.04 kB sequence 5' to the 101P3A1 1 gene, suggesting the presence of specific enhancer 
regions that may mediate tissue specific 101P3A1 1 transcription. Such transcription factors include, but are not 

25 limited to, NFAT, NF-1, NF-E, CP2, API, AP-2, Spl, OCT-I, OCT-2, NFKB, CREB, CTF, TFIIA, TFIIB, TFIID, 
Pit-I, C/EBP, SRF, and various steroid receptors, such as glucocorticoid receptor (GR) and androgen receptors (AR) 
(Mitchell P J and Tijan R (1989) Science 245: 371). Comparison of the 5 kB upstream sequence of the 101P3A1 1 
gene to the 5 kB upstream sequence of the PSA gene, 5 homologous regions were found that are important for 
prostate cell specific expression. Figure XXXI shows the alignment of the these sequences and also indicates 

30 predicted transcription factor binding sites common to both sequences identified using SIGNAL SCAN. 

Experimentally, one defines the regions in the 5' genomic upstream regions of the 101P3A11 gene using 
various methods well known in the art, such as deletion and mutation analysis of the putative regulatory regions 
fused to a transcriptional reporter gene such as luciferase or chloramphenicol acetyl-transferase. These 
transcriptional reporter vectors are then introduced into cell lines, tissues, or transgenic animals to analyze the tissue 

35 and cell type specificity of transcription and expression of the reporter gene. To identify transcription factors and 
proteins that interact with specific 101P3A11 transcriptional regulatory sequences, one employs one or more of 
various techniques known in the art such as DNAse footprinting, gel mobility shift assays, and DNA/protein affinity 
chromatography. Various techniques concerning use of promoters are set forth, e.g., U.S. Patent 5,919,652 which 
concerns embodiments of nucleic acid compositions that comprise prostate specific antigen (PSA) promoter alone 

40 or in combination with a cytomegalovirus (CMV) promoter and related uses.; and, U.S. Patent 6,110,702 which 
concerns PSA positive regulatory sequence (PSAR) and related uses. 
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Once regulatory sequences are identified that mediate 101P3A11 tissue-specific expression, these 
sequences are employed in various gene therapeutic strategies for cancer, such as driving tissue-specific expression 
of a toxic gene or a cell suicide gene. Such cell suicide strategies are currently employed using the PSA-promoter 
enhancer using the thymidine kinase/ganciclovir system (Suzuki S, Tadakuma T, Asano T, Hayakawa M. Cancer 
Res. 2001 Feb 15;6 1(4): 1276-90). Unlike PSA, which is an androgen regulated gene, 101P3A11 does not exhibit 
androgen regulated expression. Thus, identification and use of regulatory sequences of the 101P3A11 gene that 
mediate, e.g., prostate-specific, but androgen insensitive gene expression, is useful for the treatment of both early 
stage androgen sensitive and late stage androgen insensitive or hormone refractory prostate cancer. 

Throughout this application, various website data content, publications, patent applications and patents are 
referenced. (Websites are referenced by their Uniform Resource Locator, or URL, addresses on the World Wide 
Web.) The disclosures of each of these references are hereby incorporated by reference herein in their entireties. 

The present invention is not to be limited in scope by the embodiments disclosed herein, which are 
intended as single illustrations of individual aspects of the invention, and any that are functionally equivalent are 
within the scope of the invention. Various modifications to the models and methods of the invention, in addition to 
those described herein, will become apparent to those skilled in the art from the foregoing description and 
teachings, and are similarly intended to fall within the scope of the invention. Such modifications or other 
embodiments can be practiced without departing from the true scope and spirit of the invention. 


sd-63712 


109 


TABLES 


5 TABLE I: Exemplary Tissues that Express 101P3A11 

Normal Tissues: 

Prostate 
1 0 Ovary (by RT-PCR only) 


Malignant Tissues: 

1 5 Rectum 

Prostate 

Colon 

Kidney 

Breast 
20 Uterus 

Cervix 

Stomach 

Metastases Pool 

25 

TABLE II: AMINO ACID ABBREVIATIONS 


SINGLE LETTER 

THREE LETTER 

FULL NAME 




F 

Phe 

phenylalanine 

L 

Leu 

leucine 

S 

Ser 

serine 

Y 

Tyr 

tyrosine 

C 

Cys 

cysteine 

W 

Trp 

tryptophan 

P 

Pro 

proline 

H 

His 

histidine 

Q 

Gin 

glutamine 

R 

Arg 

arginine 

I 

lie 

isoleucine 

M 

Met 

methionine 

T 

Thr 

threonine 

N 

Asn 

asparagine 

K 

Lys 

lysine 

V 

Val 

valine 

A 

Ala 

alanine 

D 

Asp 

aspartic acid 
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E 

Glu 

glutamic acid 

G 

Gly 

glycine 
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TABLE ni: AMINO ACID SUBSTITUTION MATRIX 


Adapted from the GCG Software 9.0 BLOSUM62 amino acid substitution matrix (block substitution 
matrix). The higher the value, the more likely a substitution is found in related, natural proteins. (See URL 
www.ikp.unibe.ch/manual/blosum62.html ) 

A C D 
4 0-2 
9 -3 
6 


E 

F 

G 

H 

I 

K 

L 

M 

N 

P 

Q 

R 

S 

T 

V 

W 

Y 

* 

-1 

-2 

0 

-2 

-1 

-1 

-1 

-1 

-2 

-1 

-1 

-1 

1 

0 

0 

-3 

-2 

A 

-4 

-2 

-3 

-3 

-1 

-3 

-1 

- 1 

-3 

-3 

-3 

-3 

-1 

-1 

-1 

-2 

- z 

C 

2 

-3 

-1 

-1 

-3 

-1 

-4 

-3 

1 

- 1 

0 

-2 

0 

-1 

-3 

-4 

-3 

D 

5 

-3 

-2 

0 

-3 

1 

-3 

-2 

0 

- 1 

2 

0 

0 

-1 

-2 

-3 

-2 

E 


6 

-3 

- 1 

0 

- 3 

0 

0 

-3 

-4 

-3 

-3 

-2 

-2 

-1 

1 

3 

F 



6 

-2 

-4 

-2 

-4 

-3 

0 

-2 

-2 

-2 

0 

-2 

-3 

-2 

-3 

G 




8 

-3 

-1 

-3 

-2 

1 

-2 

0 

0 

-1 

-2 

-3 

-2 

2 

H 





4 

-3 

2 

1 

-3 

-3 

-3 

-3 

-2 

-1 

3 

-3 

-1 

I 






5 

-2 

-1 

0 

-1 

1 

2 

0 

-1 

-2 

-3 

-2 

K 







4 

2 

-3 

-3 

-2 

-2 

-2 

-1 

1 

-2 

-1 

L 








5 

-2 

-2 

0 

-1 

-1 

-1 

1 

-1 

-1 

M 









6 

-2 

0 

0 

1 

0 

-3 

-4 

-2 

N 










7 

-1 

-2 

-1 

-1 

-2 

-4 

-3 

P 











5 

1 

0 

-1 

-2 

-2 

-1 

Q 












5 

-1 

-1 

-3 

-3 

-2 

R 













4 

1 

-2 

-3 

-2 

S 
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TABLE IV (A): 


SUPERMOTIFS 

POSITION 

POSITION 

POSITION 


2 (Primary Anchor) 

3 (Primary Anchor) 

C Terminus (Primary 
Anchor) 

Al 

TILVMS 


FWY 

A2 

LIVMATQ 


TVMATL 

A3 

VSMA7Z7 


RK 

A24 

YF WIVLMT 


F1YWLM 

B7 

P 


VILFMWYA 

B27 

RHK 


FYL WMIVA 

B44 

ED 


FWYLIMVA 

B58 

ATS 


FXVYLIVMA 

B62 

QUVMP 


FWYMIVLA 





MOTIFS 




Al 

TSM 


Y 

Al 


DEAS 

Y 

A2.1 

LMVQIAT 


VLIMAT 

A3 

LM VIS ATF CGD 


KYRHFA 

All 

VTMLIS AGN CDF 


KRYH 

A24 

YFWM 


FLIW 

A*3101 

MVTALIS 


RK 

A*3301 

MVALF75T 


RK 

A*6801 

AVTMSLI 


RK 

B*0702 

P 


L,MF WYAIV 

B*3501 

P 


LMFWY/K4 

B51 

P 


YAVF WYAM 

B*5301 

P 


IMFVfYALV 

B*5401 

P 


XTWLMFWY 


Bolded residues are preferred, italicized residues are less preferred: A peptide is considered motif-bearing if it has 
primary anchors at each primary anchor position for a motif or supermotif as specified in the above table. 


TABLE IV (B): HLA CLASS II SUPERMOTIF 


1 

6 

9 




W, F, Y, V, .1, L 

A, V, I, L, P, C, S, T 

A, V, I, L, C, S, T, M, Y 
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TABLE V: 

HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 9-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DIS ASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

1 

210 

GLDSLLISF 

25.000 

1. 

2 

1 

MVDPNGNES 

5.000 

2. 

3 

29 

LAFPLCSLY 

5.000 

3. 

4 

21 

GLEEAQFWL 

4.500 

4. 

5 

212 

DSLLISFSY 

3.750 

5. 

6 

41 

VLGNLTIIY 

2.500 

6. 

7 

245 

HVCAVFIFY 

2.500 

7. 

8 

57 

LHEPMYIFL 

2.250 

8. 

9 

111 

GMESTVLLA 

2.250 

9. 

10 

6 

GNESSATYF 

2.250 

10. 

11 

259 

LSMVHRFSK 

1.500 

11. 

12 

114 

STVLLAMAF 

1.250 

12. 

13 

190 

ACDDIRVNV 

1.000 

13. 

14 

74 

LISTSSMPK 

1.000 

14. 

15 

158 

MAPLPVFIK 

1.000 

15. 

16 

78 

SSMPKMLAI 

0.750 

16. 

17 

172 

RSNILSHSY 

0.750 

17. 

18 

134 

HATVLTLPR 

0.500 

18. 

19 

44 

NLTIIYIVR 

0.500 

19. 

20 

117 

LLAMAFDRY 

0.500 

20. 

21 

116 

VLLAMAFDR 

0.500 

21. 

22 

70 

GIDILISTS 

0.500 

22. 

23 

231 

TREAQAKAF 

0.450 

23. 

24 

180 

YCLHQDVMK 

0.400 

24. 

25 

137 

VLTLPRVTK 

0.400 

25. 

26 

183 

HQDVMKLAC 

0.375 

26. 

27 

5 

NGNESSATY 

0.250 

27. 

28 

138 

LTLPRVTKI 

0.250 

28. 

29 

52 

RTEHSLHEP 

0.225 

29. 

30 

145 

KIGVAAVVR 

0.200 

30. 

31 

56 

SLHEPMYIF 

0.200 

31. 

32 

242 

CVSHVCAVF 

0.200 

32. 

33 

256 

FIGLSMVHR 

0.200 

33. 

34 

287 

VLNPIVYGV 

0.200 

34. 

35 

156 

ALMAPLPVF 

0.200 

35. 

36 

125 

YVAICHPLR 

0.200 

36. 

37 

248 

AVFIFYVPF 

0.200 

37. 

38 

273 

LPVILANIY 

0.125 

38. 

39 

95 

QFDACLLQI 

0.125 

39. 

40 

271 

SPLPVILAN 

0.125 

40. 

41 

11 

ATYFILIGL 

0.125 

41. 

42 

159 

APLPVFIKQ 

0.125 

42. 

43 

128 

ICHPLRHAT 

0.100 

43. 

44 

157 

LMAPLPVFI 

0.100 

44. 

45 

283 

LVPPVLNPI 

0.100 

j 45. 

46 

77 

TSSMPKMLA 

0.075 

46. 

47 

68 

LSGIDILIS 

0.075 

47. 

48 

176 

LSHSYCLHQ 

0.075 

48. 

49 

9 

SSATYFILI 

0.075 

49. 
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TABLE V: 

HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

50 

8 

ESSATYFIL 

0.075 

50. 

51 

218 

FSYLLELKT 

0.075 

51. 

52 

216 

ISFSYLLIL 

0.075 

52. 

53 

90 

NSTTIQFDA 

0.075 

53. 

54 

55 

HSLHEPMYI 

0.075 ! 

54. 

55 

270 

DSPLPVILA 

0.075 

55. 

56 

285 

PPVLNPIVY 

0.062 

56. 

57 

233 

EAQAKAFGT 

0.050 

57. 

58 

203 

IVIISAIGL 

0.050 

58. 

59 

188 

KLACDDIRV 

0.050 

59. 

60 

97 

DACLLQIFA 

0.050 

60. 

61 

19 

LPGLEEAQF 

0.050 

61. 

62 

253 

YVPFIGLSM 

0.050 

62. 

63 

98 

ACLLQIFAI 

0.050 

63. 

64 

164 

FIKQLPFCR 

0.050 

64. 

65 

33 

LCSLYLIAV 

0.050 

65. 

66 

288 

LNPIVYGVK 

0.050 

66. 

67 

260 

SMVHRFSKR 

0.050 

67. 

68 

67 

MLSG1D1LI 

0.050 

68. 

69 

126 

VAICHPLRH 

0.050 

69. 

70 

240 

GTCVSHVCA 

0.050 

70. 

71 

54 

EHSLHEPMY 

0.050 

71. 

72 

276 

ILANIYLLV 

0.050 

72. 

73 

215 

LISFSYLLI 

0.050 

73. 

74 

217 

SFSYLLILK 

0.050 

74. 

75 

83 

MLAIFWFNS 

0.050 

75. 

76 

39 

IAVLGNLTI 

0.050 

76. 

77 

246 

VCAVFIFYV 

0.050 

77. 

78 

162 

PVFIKQLPF 

0.050 

78. 

79 

208 

AIGLDSLLI 

0.050 

79. 

80 

250 

FIFYVPFIG 

0.050 

80. 

81 

155 

AALMAPLPV 

0.050 

81. 

82 

65 

LCMLSGIDI 

0.050 

82. 

83 

10 

SATYFILIG 

0.050 

83. 

84 

268 

RRDSPLPVI 

0.050 

84. 

85 

92 

TTIQFDACL 

0.050 

85. 

86 

181 

CLHQDVMKL 

0.050 

86. 

87 

62 

YIFLCMLSG 

0.050 

87. 

88 

197 

NVVYGLIVI 

0.050 

88. 

89 

135 

ATVLTLPRV 

0.050 

89. 

90 

100 

LLQIFAIHS 

0.050 

90. 

91 

282 

LLVPPVLNP 

0.050 

91. 

92 

45 

LTIIYTVRT 

0.050 

i_ 92 - 

93 

205 

IISAIGLDS 

0.050 

93. 

94 

22 

LEEAQFWLA 

0.045 

94. 

95 

297 

TKEIRQRIL 

0.045 

95. 

96 

113 

ESTVLLAMA 

0.030 

96. 

97 

243 

VSHVCAVFI 

0.030 

97. 

98 

75 

ISTSSMPKM 

0.030 

98. 

99 

34 

CSLYLIAVL 

0.030 

99. 
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TABLE V: 


HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

100 

224 

LKTVLGLTR 

0.025 

100. 


TABLE VI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 

T TCTTrWi"' 

SCORE (ESTIMATE OF HALF TIME OF 

TA¥C A OCn/^¥ A T r T/^k'M /™\¥7 A A/TAT T/^FTT H 1 

DISAooOCJAIlUN UF A MOLECULE 

/^^VXT r TT A TXTTnVT/^' TIIIC CT TX> C 17 rf^¥ TTT XT/^*1?\ 

C UlN 1 AliN 1IN O 1 Hid oUoalLvJUlLfNCL) 

SEQ.ID# 

i 
1 

1 

M V DITN LrlN fc o b 

j.UUU 

101. 

2 

1 C7 
ID 1 

T TV A A DT D\ /TTIT/" 

LMArLr Vr IK. 

J.UUU 

102. 

3 


CCA yTDV IV iTT A TIT 

b b IVLr isJVLL Air 

3.UUU 

103. 

A 

4 

A A 

4U 

A V LAjIN L> 1 11 Y 

2.jUU 

104. 

c 

J 

1 A 1 

iy i 

CDD1K V JN V V Y 

o c Art 

105. 

6 

5/ 

LhLbrM Y lr LC 

1 O C A 

2.250 

106. 

1 

22 

T EC A /"XTTAI IT A T7 

LbbAQr WLAr 

1 1 C A 

2.250 

107. 

o 
O 

297 

r ~pT/""r:TT) /"^VTT TT "T> 

1 KLLKQR1LR 

2.250 

108. 

9 

CT 

52 

R I bHSLHbP M 

2.250 

109. 

i n 
10 

111 

GMbS I VLLAM 

^ O C A 

2.250 

110. 

1 1 

TOT 

287 

VLNFIVYCjVK 

^ AAA 

2.000 

111. 

12 

216 

TCECVT T TT "t/" 

IbrSYLLLLK 

1 C AA 

1.500 

112. 

1 1 
13 

/3 

TT TCTCCXrDV 

1 AAA 

1.000 

113. 

1 A 

14 

25o 

(jLSMVHRr SK 

1 AAA 
1.000 

114. 

15 

1 1 
21 

GLbbAQFWLA 

A AAA 

0.900 

115. 

lo 

113 

T7 c t\ n t a a / a r 
1 VLLAMAr 

A TCA 

U.75U 

116. 

1 *7 
1 / 

243 

X /C T TX T A A /TDTC 

VaHVCAVrlr 

A *7 CA 

0.750 

117. 

1 O 

lo 

284 

VPPVLNPIVY 

0.625 

118. 

1 a 

2y 

T A TT"DT PPT "VT 

LArrLCSLYL 

A C A A 
0.500 

119. 

oa 
20 

*7A 
70 

/^TT^TT TCTCC 
GIDILlbl 

A C A A 
0.500 

120. 

21 

1 16 

VLLAMArUKY 

A C AA 

0.500 

121. 

22 

113 

T"\/T T A TVvT A T?T\T> 

1 VLLAMArDK 

A CAA 

0.500 

122. 

23 

21U 

PT r\CT T TCTTC 

A CAA 

0.500 

123. 

24 

1 on 

iyu 

ACIJIJIKVIN V V 

A CAA 

U.jUU 

124. 

0 c 
23 

2o 

M/T A TCTDT PCT XT' 

W LAr rLtbL Y 

A CAA 

U.5UU 

125. 

2o 

ti 
o 

{~* x TTC CCA TTX/TTT 

CriNbiSbA 1 Yr 1 

A A CA 

0.450 

126. 

in 

2 / 

13o 

1 VLlLrKV liv 

A A A A 

127. | 

2o 

cc 

ribbrlbriYl i lr 

A 1AA 

128. 

2y 

nc 

yj 

/^iTTTPv A f^J T P^TTT 

yr D ACbbl^lr 

A OCA 

129. 


1 1R 

LflLtl lv V 1 JV1 VJ 


130. 

31 

128 

ICHPLRHATV 

0.200 

131. 

32 

155 

AALMAPLPVF 

0.200 

132. 

33 

247 

CAVFIFYVPF 

0.200 

133. 

34 

283 

LVPPVLNPIV 

0.200 

134. 

35 

241 

TCVSHVCAVF 

0.200 

135. 

36 

18 

GLPGLEEAQF 

0.200 

136. 

37 

259 

LSMVHRFSKR 

0.150 

137. 

38 

43 

GNLTIIYIVR 

0.125 

138. 

39 

244 

SHVCAVFIFY 

0.125 

139. 

40 

209 

IGLDSLLISF 

0.125 

140. 

41 

161 

LPVFIKQLPF 

0.125 

141. 

42 

211 

LDSLLISFSY 

0.125 

142. 

43 

76 

STSSMPKMLA 

0.125 

143. 
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TABLE VI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

44 

256 

FIGLSMVHRF 

0.100 

144. 

45 

282 

LLVPPVLNPI 

0.100 

145. 

46 

231 

TREAQAKAFG 

0.090 

146. ! 

47 

77 

TSSMPKMLAI 

0.075 

147. 

48 

270 

DSPLPVILAN 

0.075 

148. 

49 

8 

ESSATYFILI 

0.075 

149. 

50 

68 

LSGIDILIST 

0.075 

150. 

51 

9 

SSATYFILIG 

0.075 

151. 

52 

272 

PLPVILANIY 

0.050 

152. 

53 

214 

LLISFSYLLI 

0.050 

153. 

54 

245 

HVCAVFIFYV 

0.050 

154. 

55 

175 

BLSHSYCLHQ 

0.050 

155. 

56 

289 

NPIVYGVKTK 

0.050 

156. 

57 

41 

VLGNLTIIYI 

0.050 

157. 

58 

105 

AIHSLSGMES 

0.050 

158. 

59 

125 

YVAICHPLRH 

0.050 

159. 

60 

202 

LIVIISAIGL 

0.050 

160. 

61 

260 

SMVHRFSKRR 

0.050 

161. 

62 

97 

DACLLQIFAI 

0.050 

162. 

63 

56 

SLHEPMYIFL 

0.050 

163. 

64 

35 

SLYLIAVLGN 

0.050 

164. 

65 

195 

RVNVVYGLIV 

0.050 

165. 

66 

215 

LISFSYLLEL 

0.050 

166. 

67 

i 221 

LLILKTVLGL 

0.050 

167. 

68 

67 

MLSGIDILIS 

0.050 

168. 

69 

' 268 

RRDSPLPVIL 

0.050 

169. 

70 

271 

SPLPVILANI 

0.050 

170. 

71 

! 207 

SAIGLDSLLI 

0.050 

171. 

72 

102 

QIFAIHSLSG 

0.050 

172. 

73 

64 

FLCMLSGIDI 

0.050 

173. 

74 

208 

AIGLDSLLIS 

0.050 

174. 

75 

\ 204 

VIISAIGLDS 

0.050 

175. 

76 

; 93 

TIQFDACLLQ 

0.050 

176. 

77 

275 

VILANIYLLV 

0.050 

177. ! 

78 

10 

SATYFILIGL 

i 0.050 

178. 

79 

198 

VVYGLIVIIS 

j 0.050 

179. 

80 

24 

EAQFWLAFPL 

0.050 

180. 

81 

154 

GAALMAPLPV 

0.050 

181. 

82 

91 

STTIQFDACL 

0.050 

182. 

83 

228 

LGLTREAQAK 

0.050 

183. 

84 

99 

CLLQIFAIHS 

0.050 

184. 

85 

38 

LIAVLGNLTI 

0.050 

185. 

o c 
oo 

53 

1 bHSLrtEPMY 

0.050 

186. 

87 

223 

ILKTVLGLTR 

0.050 

187. 

88 

158 

MAPLPVFIKQ 

0.050 

188. 

89 

250 

FIFYVPFIGL 

0.050 

189. 

90 

180 

YCLHQDVMKL 

0.050 

190. 

91 

109 

LSGMESTVLL 

0.030 

191. 

92 

34 

CSLYLIAVLG 

! 0.030 

192. 
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TABLE VI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- Al, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LIJSIIINo 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CUJNI AINJJNCj IJlIh SUBaLQUENCL) 

SEQ.IDtf 

yj 

ZZd 

Kl VLOLIKLA 

O.uZd 

193. 

OA 

c 

J 

XJ/TMP QQ A TVTT 
INulMiooA 1 if 

U.UZj 

194. 

95 

230 

LTREAQAKAF 

0.025 

195. 

96 

171 

CRSNILSHSY 

0.025 

196. 

97 

87 

FWFNSTTIQF 

0.025 

197. 

98 

17 

IGLPGLEEAQ 

0.025 

198. 

99 

82 

KMLAIFWFNS 

0.025 

199. 

100 

4 

PNGNESSATY 

0.025 

200. 


TABLE VII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 9-MERS 


RANK 

START 
rUMHOIN 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

1 

213 

SLLISFSYL 

825.977 

ZU1. 

2 

287 

VLNPIVYGV 

271.948 

zuz. 

3 

188 

KLACDDIRV 

243.432 

OA! 

4 

28 

WLAFPLCSL 

226.014 

ZU4. 

5 

246 

VCAVFIFYV 

215.192 

one 

6 

21 

GLEEAQFWL 

178.815 

ZUo. 

7 

276 

ILANIYLLV 

177.358 

Z\J 1 . 

8 

220 

YLLILKTVL 

149.071 

ZUo. 

9 

214 

LLISFSYLL 

138.001 

Z\)y . 

10 

14 

FILIGLPGL 

114.985 

Z 1U. 

11 

157 

LMAPLPVFI 

70.450 

Oil 
Z 1 1 . 

12 

279 

NIYLLVPPV 

70.387 

ZIZ. 

13 

108 

SLSGMESTV 

69.552 

Z 1 j. 

14 

82 

KMLAIFWFN 

54.625 

0 1 A 
Z 14. 

15 

181 

CLHQDVMKL 

49.134 

Z. 1 J . 

16 

25 

AQFWLAFPL 

46.480 

216. 

17 

275 

VILANIYLL 

42.494 

217. 

18 

222 

LILKTVLGL 

42.494 

218. 

19 

37 

YLIAVLGNL 

29.382 

219. 

20 

66 

CMLSGIDIL 

26.377 

220. 

21 

118 

LAMAFDRYV 

25.398 

221. 

22 

201 

GLIVIISAI 

23.995 

222. 

23 

306 

RLFHVATHA 

18.382 

223. 

24 

67 

MLSGIDILI 

17.736 

224. 

25 

85 

AIFWFNSTT 

14.407 

225. 

26 

L 49 

YIVRTEHSL 

13.512 

226. 

27 

174 

NILSHSYCL 

10.868 

227. 

28 

167 

QLPFCRSNI 

10.433 

228. 

29 

60 

PMYEFLCML 

9.493 

229. 

30 

101 

LQIFAIHSL 

8.469 

230. 

31 

254 

VPFIGLSMV 

6.568 

231. 

32 

120 

MAFDRYVAI 

5.605 

232. 

33 

83 

MLAIFWFNS 

4.747 

233. 

34 

198 

VVYGLIVII 

4.683 

234. 
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TABLE VII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

Tl TT 1 C ¥¥"\ W TXT 1 

RESIDUE 
Ltla 1 UN 1j 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIA1ION OF A MOLECULE 
LOJNIA1NIINO 1H1!S SUBSEQUENCE) 

SEQ.ID# 

jj 

1 £A 

loO 

ri^r V r lisA^L 

A 1 AO 

4.10o 

235. 

jO 

3U4 

TT "D T tro \/ A T 

LLivLr ri V A 1 

1 /CCA 

236. 

1H 
5 / 

213 

T ICUCVT T T 

3.658 

237. 

3o 

1 CC 

135 

A AT AjT ADT r>\ / 

AALMArLr V 

3.3 /4 

238. 

1Q 

3V 

283 

LVrr VUN.rl 

3.569 

239. 

40 


A r^l T f^TT? A T 

AtLL^lr Ai 

3.361 

240. 

41 

Af\ 

4U 

A \/T /^XTT TTT 

AViAJiNLlIl 

"5 1 OC 

3.185 

241. 

/I o 

42 

203 

IVllbAIOL 

1 1 TO 

3.178 

242. 

43 

V3 

T'T/~\Trr\ A /"'T T 

1 IQrDACLL 

2.937 

243. 

A A 

44 

3 1 

rrLLoLYLl 

^ cue 
2.666 

244. 

45 

1 1 c 

135 

A /T TT TJD "\ 7 

A 1 VL I LrK V 

2.222 

245. 

A£. 

4o 

1 1 

Al YrlLlOL 

O 10/4 

2.184 

246. 

47 

1 1 A 
1 10 

oOMtb 1 VLL 

O 1 1 c 

2.1 15 

247. 

48 

1 1 o 

138 

T TT TiTi \ fTT/ T 

L 1 LrK V 1 Kl 

2.096 

248. 

/< a 

49 

253 

YVPrlGLSM 

2.000 

249. 

c a 
50 

303 

"T> TT T> T T7T T\ 7 A 

RJLRLFHVA 

1.969 

250. 

C 1 

51 

33 

T r<CT "V7T T A X7 

LCSLYLIAV 

1.775 

251. 

CO 

52 

38 

T T A171 /~»TvTT T 

LIAVLONLr 

1 HA O 

1.742 

252. 

53 

o i o 

218 

T7CV7 T TT ~\7~ T 

rSYLLILKT 

i a ah 

1.647 

^ 253. 

C A 

54 

43 

ONLT1IYIV 

1.584 

254. 

55 

i in 

119 

AMAFDRYVA 

1.471 

255. i 

56 

v( O 

42 

T OXTT TTTV/T 

LGNLTIIYI 

1.465 

256. 

57 

O A O 

208 

A T/^T HOT T T 

AlOLDSLLI 

1.435 

257. 

CO 

58 

200 

~\r/~~* T TT7TTC A 

YOLIVIISA 

1.270 

258. 

59 

281 

YLLVPPVLN 

1.268 

259. 

60 

92 

TTIQFDACL 

1.127 

260. 

£. 1 

61 

7 

NESSATYFI 

1.116 

261. 

62 

241 

TCVSHVCAV 

1.044 

262. 

63 

197 

\n 7"\ 7~V/'/~'T T"\ 7 T 

NVVYOLIVI 

0.861 

263. 

64 

216 

icrcxrf T TT 

1SFSYLLIL 

A OH~I 

0.827 

264. 

c c 
65 

1 aa 
190 

ACDD1R VN V 

A lAf 

0.745 

265. 

66 

1 A A 

104 

r AIHSLSOM 

0.730 

266. 

67 

243 

VSHVCAVrl 

a £n 

0.637 

267. 

68 

78 

CCA /f n \/~ X yf T AT 

S SMPKMLAI 

A C O A 

0.580 

268. 

/CO 

69 

1 lo 

\ Tl T A Ajf A TTTVO 

V LLAM Ar UK 

0.544 

269. 

/u 

111 

111 


A COO 

0.528 

270. 

H 1 
/l 

OTO 

111 

T>T T>\7TT A XTT 

A COO 

0.528 

271. 

*70 

72 

1 A 

34 

PCT \/I t a A7T 

CbLYLlAVL 

0.487 

272. 

HI 
ID 

i 

100 

T T /"\TT7 A TTJC 

LLC^lr Airi^ 

A A O 1 

0.481 

273. 

/4 

/IA 

69 

SOlDlLlb 1 

A /i C A 

0.459 

274. 

O C 

/3 

1 A O 

142 

D\7TVTr , A7 A A 

KV IKICjVAA 

0.435 

275. 

/o 

1 <A 

130 

A A 7\ 7T> A AT \A 

AV VKuAALM 

0.433 

276. 

7*7 

03 


A /IOC 

277. 

78 

250 

FIFYVPFIG 

0.415 

278. 

79 

45 

LTIIYIVRT 

0.405 

279. 

80 

62 

YIFLCMLSG 

0.401 

280. 

81 

112 

MESTVLLAM 

0.378 

281. 

82 

168 

LPFCRSNIL 

0.360 

282. 

83 

84 

LAIFWFNST 

0.334 

283. 


sd-63712 


124 


TABLE VII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

84 

226 

TVLGLTREA 

0.322 

284. 

85 

153 

A A T AT A TkT 

RGAALMAPL 

0.321 

285. 

86 

284 

VPPVLNPIV 

0.316 

286. 

87 

196 

VNWYGLIV 

0.316 

287. 

88 

127 

AICHPLRHA 

0.314 

288. 

89 

149 

AAWRGAAL 

0.297 

289. i 

90 

207 

O A TOT T"Vf1T T 

SAIGLDSLL 

0.297 

290. 

91 

76 

STSSMPKML 

0.297 

291. 

92 

56 

SLHEPMYTF 

0.288 

292. 

93 

206 

ISAIGLDSL 

0.267 

293. 

94 

156 

ALMAFLrvr 

u.zoU 

294. 

95 

39 

IAVLGNLTI 

0.246 

295. 

96 

35 

SLYLIAVLG 

0.238 

296. 

97 

17 

IGLPGLEEA 

0.230 

297. 

98 

274 

PV1LANIYL 

0.226 

298. 

99 

186 

VMKLACDDI 

0.220 

299. 

100 

99 

CLLQIFAIH 

0.215 

300. 


TABLE VHI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQJD# 

1 

56 

SLHEPMYIFL 

722.583 

301. 

2 

117 

LLAMAFDRYV 

494.237 

302. 

3 

213 

SLLISFSYLL 

300.355 

303. 

4 

41 

VLGNLTI1YI 

224.357 

304. 

5 

156 

ALMAPLPVFI 

212.307 

305. 

6 

250 

FIFYVPFIGL 

94.987 

306. 

7 

275 

VILANIYLLV 

90.231 

307. 

8 

221 

LLILKTVLGL 

83.527 

308. 

9 

100 

LLQIFAIHSL 

83.527 

309. 

10 

139 

TLPRVTKIGV 

69.552 

310. 

11 

253 

YYPFIGLSMV 

64.388 

311. 

12 

94 

IQFDACLLQI 

62.741 

312. 

13 

245 

HVCAVFIFYV 

57.690 

313. 

14 

62 

YIFLCMLSGI 

56.155 

314. 

15 

64 

FLCMLSGIDI 

47.991 

315. 

16 

248 

AVFIFYVPFI 

42.727 

316. 

17 

137 

VLTLPRVTKI 

40.792 

317. 

18 

282 

LLVPPVLNPI 

40.792 

318. 

19 

37 

YLIAVLGNLT 

34.279 

319. 

20 

237 

KAFGTCVSHV 

28.772 

320. 

21 

66 

CMLSGIDILI 

27.879 

321. 

22 

234 

AQAKAFGTCV 

26.797 

322. 

23 

214 

LLISFSYLLI 

26.604 

323. 

24 

82 

KMLAIFWFNS 

26.114 

324. 

25 

83 

MLAIFWFNST 

24.070 

325. 
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TABLE Vni: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

26 

21 

GLEEAQFWLA 

18.576 

326. 

27 

44 

NLTIIYIVRT 

17.140 

327. 

28 

218 

FSYLLILKTV 

15.371 

328. 

29 

303 

r% TT" T\ T T^T TT 7 A 

RILRLFHVAT 

14.407 

329. j 

30 

142 

T\ T fT*T/T/*1 , I T A A ~% 7 

RVTKIGVAAV 

13.997 

330. ! 

31 

166 

KQLPFCRSNI 

13.698 

331. 

32 

181 

/IT ft ATM t™» rwrT A 

CLHQDVMKLA 

1 1.426 

332. 

33 

119 

A A A A 1 * I X TT% T 7T TAT 

AMAFDRYVAI 

11.302 

333. 

34 

29 

LAFPLCSLYL 

10.264 

334. 

35 

108 

SLSGMESTVL 

8.759 

335. 

36 

167 

QLPFCRSNIL 

8.759 

336. 

37 

227 

VLGLTREAQA 

8.446 

337. 

38 

301 

RQRILRLFHV 

7.149 

338. 

39 

189 

LACDDIRVNV 

6.733 

339. 

40 

205 

IISAIGLDSL 

5.628 ! 

340. 

41 

180 

YCLHQDVMKL 

5.459 

341. 

42 

85 

AIFWFNSTTI 

5.308 

342. 

43 

242 

CVSHVCAVFI 

5.021 

343. 

44 

202 

LIVIISAIGL 

4.993 

344. 

45 

229 

GLTREAQAKA 

4.968 

345. 

46 

291 

IVYGVKTKEI 

4.966 

346. 

47 

215 

LISFSYLLIL 

4.709 

347. 

48 

159 

APLPVFIKQL 

4.510 

348. 

49 

283 

LVPPVLNPIV 

4.242 

349. 

50 

279 

NIYLLVPPVL 

3.854 

350. 

51 

25 

AQFWLAFPLC 

3.541 

351. 

52 

32 

PLCSLYLIAV 

3.519 

352. 

53 

298 

KEIRQRILRL 

3.344 

353. 

54 

74 

LISTSSMPKM 

2.671 

354. 

55 

200 

YGLIVIISAI 

2.666 

355. 

56 

195 

RVNVVYGLIV 

2.495 

356. 

57 

38 

LIAVLGNLTI 

2.439 

357. 

58 

240 

GTCVSHVCAV 

2.222 

358. 

59 

222 

LILKTVLGLT 

1.927 

359. 

60 

120 

MAFDRYVAIC 

1.678 

360. 

61 

20 

PGLEEAQFWL 

1.485 

361. 

62 

7 

NESSATYFIL 

1.482 

362. 

63 

16 

LIGLPGLEEA 

1.309 

363. 

64 

273 

LPVILANIYL 

1.304 

364. 

65 

220 

YLLILKTVLG 

1.268 

365. 

66 

286 

PVLNPIVYGV 

1.139 

366. 

67 

110 

SGMESTVLLA 

1.132 

367. 

CO 

OO 

C\ 1 

91 

STTIQFDACL 

1.127 

368. 

69 

99 

CLLQIFAIHS 

1.048 

369. 

70 

278 

ANIYLLVPPV 

1.044 

370. 

71 

127 

AICHPLRHAT 

1.025 

371. 

72 

154 

GAALMAPLPV 

0.966 

372. 

73 

89 

FNSTTIQFDA 

0.865 

373. 

74 

197 

NVVYGLIVII 

0.861 

374. 
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TABLE VIII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-0201, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

75 

232 

RE AQ AKAr 0 1 

0.840 

375. 

76 

1 o 

128 

ICHrLRHATV 

0.772 

376. 

77 

42 

LGNLT1IYIV 

0.728 

377. 

no 

78 

190 

ACDD1RVNVV 

0.702 

378. 


10 

bAl YrlLIGL 

0.682 

379. 

80 

OA 

84 

T A rCII 7T7XTC TTT 

L A1FW FN SlI 

0.669 

380. 

O 1 

81 

1 Ol 

187 

MKLACDDIRV 

0.608 

381. 

52 

281 

~\.7~W T "\ TTiTiT. 7T XTTk 

YLLVPPVLNP 

0.583 

382. 

83 

271 

SPLPVTLANI 

0.580 

383. 

O A 

84 

35 

OT "\ AT TAX 7T r'XT 

SLYLIAVLGN 

0.548 

384. 

o c 

85 

111 

GME STVLLAM 

0.528 

385. 

86 

148 

1 7 A AT TT 1 U t A AT 

VAAWRGAAL 

0.504 

386. 

0*7 

87 

162 

PVFIKQLPFC 

0.448 

387. 

88 

1 16 

1 fT T A Tk X A T^T^ T* T 7" 

VLLAMAFDRY 

0.436 

388. 

89 

150 

A T 71 7T» /~« A A T * >T A 

AVVRGAALMA 

0.435 

389. 

90 

65 

T Mf O /~ITT\TT 

LCMLSGIDIL 

0.405 

390. 

91 

212 

DSLLISFSYL 

0.404 

391. 

92 

1 12 

MESTVLLAMA 

0.378 

392. 

93 

210 

GLDSLLISFS 

0.377 

[ 393. 

y4 

28 

WLArPLCSLY 

0.343 

394. 

95 

31 

FPLCSLYLIA 

0.339 

395. 

96 

59 

EPMYTFLCML 

0.338 

396. 

97 

92 

TTIQFDACLL 

0.297 

397. 

98 

79 

SMPKMLAIFW 

0.296 

398. 

99 

306 

RLFHVATHAS 

0.276 

399. 

100 

109 

LSGMESTVLL 

0.237 

400. 


TABLE IX: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

1 

229 

GLTREAQAK 

60.000 

401. 

2 

137 

VLTLPRVTK 

30.000 

402. _j 

3 

56 

SLHEPMYIF 

20.250 

403. 

4 

260 

SMVHRFSKR 

18.000 

404. 

5 

210 

GLDSLLISF 

18.000 

405. 

6 

116 

VLLAMAFDR 

18.000 

406. 

7 

44 

NLTIIYIVR 

12.000 

407. 

8 

117 

LLAMAFDRY 

12.000 

408. 

9 

21 

GLEEAQFWL 

8.100 

409. 

10 

214 

LLISFSYLL 

8.100 

410. 

11 

41 

VLGNLTIIY 

8.000 

411. 

12 

156 

ALMAPLPVF 

6.750 

412. 

13 

201 

GLIVIISAI 

6.075 

413. 

14 

287 

VLNPIVYGV 

4.050 

414. 

15 

74 

LISTSSMPK 

4.000 

415. 

16 

181 

CLHQDVMKL 

3.600 

416. 
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TABLE IX: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQJD# 

17 

245 

HVCAVFIFY 

3.600 

417. 

18 

111 

GMESTVLLA 

3.600 

418. 

19 

79 

SMPKMLAIF 

3.000 

419. 

20 

248 

AVFIFYVPF 

3.000 

420. 

21 

158 

MAPLPVFIK 

2.700 

421. 

22 

213 

SLLISFSYL 

2.700 

422. 

23 

67 

MLS G ID ELI 

2.700 

423. 

24 

306 

RLFHVATHA 

1.500 

424. 

25 

60 

PMYIFLCML 

1.350 

425. 

26 

28 

WLAFPLCSL 

1.350 

426. 

27 

99 

CLLQIFAIH 

1.350 

427. 

28 

66 

CMLSGIDIL 

1.350 

428. 

29 

82 

KMLAIFWFN 

1.215 

429. 

30 

145 

KIGVAAWR 

1.200 

430. 

31 

188 

KLACDDIRV 

1.200 

431. 

32 

164 

FDCQLPFCR 

1.200 

432. 

33 

220 

YLLILKTVL 

0.900 

433. 

34 

157 

LMAPLPVFI 

0.900 

434. j 

35 

11 

ATYFILIGL 

0.675 

435. | 

36 

198 

VVYGLIVII 

0.675 

436. 

37 

37 

YLIAVLGNL 

0.608 

437. 

38 

167 

QLPFCRSNI 

0.600 

438. 

39 

256 

FIGLSMVHR 

0.600 

439. 

40 

276 

ILANIYLLV 

0.600 

440. 

41 

119 

AMAFDRYVA 

0.600 

441. 

42 

186 

VMKLACDDI 

0.600 

442. 

43 

261 

MVHRFSKRR 

0.600 

443. 

44 

222 

LILKTVLGL 

0.540 

444. 

45 

259 

LSMVHRFSK 

0.450 

445. 

46 

290 

PIVYGVKTK 

0.450 

446. 

47 

282 

LLVPPVLNP 

0.405 

447. 

48 

275 

VILANIYLL 

0.405 

448. 

49 

125 

YVAICHPLR 

0.400 

449. 

50 

83 

MLAIFWFNS 

0.360 

450. [ 

51 

304 

ILRLFHVAT 

0.300 

451. | 

52 

29 

LAFPLCSLY 

0.300 

452. ! 

53 

180 

YCLHQDVMK 

0.300 

453. 

54 

108 

SLSGMESTV 

0.300 

454. 

55 

242 

CVSHVCAVF 

0.300 

455. 

56 

279 

NIYLLVPPV 

0.300 

456. 

57 

114 

STVLLAMAF 

0.300 

457. 

58 

174 

NILSHSYCL 

0.270 

458. 

<o 

jy 


AC^r WLArrL 

U.27U 

459. 

60 

197 

NVVYGLIVI 

0.270 

460. 

61 

100 

LLQIFAIHS 

0.240 

461. 

62 

296 

KTKEIRQRI 

0.203 

462. 

63 

138 

LTLPRVTKI 

0.203 

463. 

64 

40 

AVLGNLTII 

0.203 

464. 

65 

283 

LVPPVLNPI 

0.203 

465. 
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TABLE IX: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

66 

175 

ILSHSYCLH 

0.200 

466. 

67 

162 

PVFIKQLPF 

0.200 

467. 

68 

203 

IVIISAIGL 

0.180 

468. 

69 

85 

AIFWFNSTT 

0.150 

469. 

70 

35 

SLYLIAVLG 

0.150 

470. 

71 

47 

IIYIVRTEH 

0.150 

471. 

72 

14 

FILIGLPGL 

0.135 

472. ! 

73 

92 

TTIQFDACL 

0.135 ! 

473. 

74 

272 

T\T "T^T nr A "V TT 

PLPVILANI 

0.135 

474. 

75 

216 

ISFSYLLIL 

0.135 

475. 

76 

160 

PLPVFIKQL 

0.135 

476. 

77 

208 

AIGLDSLLI 

0.120 

477. 

78 

215 

LISFSYLLI 

0.120 

478. 

79 

298 

KEIRQRILR 

0.108 

479. 

80 

18 

GLPGLEEAQ 

0.090 

480. 

81 

49 

YTVRTEHSL 

0.090 

481. 

82 

303 

RILRLFHVA 

0.090 

482. 

83 

134 

HATVLTLPR 

0.080 

483. 

84 

223 

ILKTVLGLT 

0.068 

484. 

85 

217 

SFSYLLILK 

0.060 

485. 

86 

253 

YVPFIGLSM 

0.060 

486. 

87 

273 

LPVILANIY 

0.060 

487. 

88 

93 

TIQFDACLL 

0.060 

488. 

! 89 

151 

WRGAALMA 

0.060 

489. 

90 

9 

SSATYFILI 

0.054 

490. 

91 

299 

EIRQRILRL 

0.054 

491. 

92 

237 

KAFGTCVSH 

0.045 

492. 

93 

281 

YLLVPPVLN 

0.045 

493. 

94 

250 

F1FYVPFIG 

0.045 

494. 

95 

98 

ACLLQIFAI 

0.041 

495. 

96 

212 

DSLLISFSY 

0.041 

496. 

97 

31 

FPLCSLYLI 

0.041 

497. 

98 

101 

LQIFAIHSL 

0.041 

498. 

99 

32 

PLCSLYLIA 

0.040 

499. 

100 

45 

LTIIYIVRT 

0.034 

500. 


TABLE X: 

HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 10-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

1 

157 

LMAPLPVFIK 

405.000 

501. 

2 

258 

GLSMVHRFSK 

180.000 

502. 

3 

73 

ILISTSSMPK 

60.000 

503. 

4 

116 

VLLAMAFDRY 

18.000 

504. 

5 

287 

VLNPIVYGVK 

13.500 

505. 

6 

18 

GLPGLEEAQF 

9.000 

506. 

7 

260 

SMVHRFSKRR 

9.000 

507. 
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TABLE X: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

8 

213 

SLLISFSYLL 

8.100 

508. 

9 

21 

GLEEAQFWLA 

8.100 

509. 

10 

223 

TT T/TT TT /~~* T ' 1 ' 1 > 

ILKTVLGLTR 

8.000 

510. 

1 1 

221 

T T TX TT" T'T TT 

LLILKTVLGL 

5.400 

511. 

12 

136 

TVLTLPRVTK 

4.500 

512. 

13 

1 o 

28 

TT TT A X^TiX ^"~iOT T T 

WLAFPLCSLY 

4.000 

513. 

1 A 

14 

186 

VMKLACDDIR 

4.000 

514. 

15 

282 

T T T n\T\t TT "V TT\T 

LLVPPVLNPI 

3.038 

515. 

16 

66 

CMLSGIDILI 

2.700 

516. 

17 

248 

AVFIFYVPFI 

2.700 

517. 

18 

111 

GMESTVLLAM 

2.700 

518. 

19 

250 

FIFYVPFIGL 

2.700 

519. 

20 

115 

TVLLAMAFDR 

1.800 

520. 

21 

137 

VLTLPRVTKI 

1.800 

521. 

22 

40 

a x rx •v tx t>ttt t 

AVLGNLTIIY 

1 .800 

522. 

23 

56 

SLHEPMYIFL 

1.800 

523. 

24 

214 

LLISFSYLLI 

1.800 

524. 

25 

41 

VLGNLTIIYI 

1.800 

525. 

26 

82 

KMLAIFWFNS 

1.620 

526. 

27 

216 

ISFSYLLILK 

1.500 

527. 

28 

64 

FLCMLSGEDI 

1.200 

528. 

29 

100 

LLQIFAIHSL 

0.900 

529. 

30 

108 

SLSGMESTVL 

0.900 

530. 

31 

156 

ALMAPLPVFI 

0.900 

531. 

32 

83 

MLAIFWFNST 

0.900 

532. 

33 

256 

FIGLSMVHRF 

0.600 

533. 

34 

304 

ILRLFHVATH 

0.600 

534. 

35 

119 

AMAFDRYVAI 

0.600 

535. 

36 

167 

QLPFCRSNIL 

0.600 

536. 

37 

35 

SLYLIAVLGN 

0.600 

537. 

38 

229 

GLTREAQAKA 

0.600 

538. 

39 

215 

LISFSYLLIL 

0.540 

539. 

40 

44 

NLTIIYIVRT 

0.450 

540. 

41 

279 

NIYLLVPPVL 

0.450 

541. 

A 'I 

42 

62 

YIFLCMLSGI 

0.450 

542. 

43 

289 

NPIVYGVKTK 

0.450 

543. 

44 

139 

TLPRVTKJGV 

0.400 

544. 

45 

79 

SMPKMLAIFW 

0.400 

545. 

46 

272 

T\T TAT 7TT A "V TT\ T 

PLPVILANIY 

0.400 

546. 

47 

131 

PLRHATVLTL 

0.360 

547. 

48 

99 

CLLQIFAIHS 

0.360 

548. 

49 

85 

AIFWFNSTTI 

0.300 

549. 



ll^r UAtLL^l 

0.270 

550. 

51 

281 

YLLVPPVLNP 

0.270 

551. 

52 

198 

VVYGLIVIIS 

0.270 

552. 

53 

245 

HVCAVFIFYV 

0.270 

553. 

54 

193 

DIRVNVVYGL 

0.243 

554. 

55 

181 

CLHQDVMKLA 

0.225 

555. 

56 

291 

IVYGVKTKEI 

0.225 

556. 
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TABLE X: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A3, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LUSTING 

SCORE (ESTIMATE OF HALF TIME OF 

T\Tfi A O O /"A T A TPT/~VM yvTT 1 A Tfc f /T¥ m/^wTT 

DISASSOCIATION OF A MOLECULE 

/^/"VTVTT' A T1VTT"M/~ > ' T'T T TO O T TT> O "I? ^T T TT7 1 "fc.T rf^ T7 \ 

CONIAININCj I IlIS SUBSEQUENCE) 

SEQ.ID# 

J 1 

306 

T> T CTTI J A T'T T A O 

KLr H V ATH AS 

U.20U 

557. 

JO 

22 / 

T.7T /^T TT> TT A f~\ A 

VLOL 1 Rh AQA 

A 1AA 

558. 

C A 

jy 

32 

T")T /"lOT T/T TAT/ 

rLCSLYLIAV 

A 1 OA 

0.180 

559. 

60 

294 

Z" 1 "! 7T/'TT/"Pm /~\T* 

GVKTKEERQR 

A 1 OA 

0.180 

560. 

C 1 

61 

210 

S~* T T"\OT T TPPC 

OLDSLLISFS 

A 1 OA 

0.180 

561. 

62 

202 

LIVTISAIGL 

A 1 OA 

0.180 

562. 

63 

67 

Tk IT c/^TTMT TO 

MLSGIDILIS 

A 1 OA 

0.180 

563. 

C A 

64 

276 

TT A XTT\/T T 1 7T» 

ILANIYLLVP 

A 1 OA 

0.180 

564. 

65 

37 

"V7T T A A 7T /^\TT T' 

YLIAVLGNLT 

0.150 

565. I 

66 

15 

TT T/T T»/~< T T"*T" > 

LLIGLPGLEE 

0.135 

566. 

67 

197 

"ktt r\ r\r/~*r ttttt 

NVVYGLIVIl 

A 1 "i C 

0.135 

567. 

68 

237 

~W A T , /^''T , /~^1 70TTT T 

KAFGTCVSHV 

A 1 T C 

0.135 

568. 

4CA 

69 

OA 1 

201 

/~> y TT 7TTO A T/^" 

GLIVIISAIG 

0.135 

569. 

70 

i 

166 

KQLPFCRSNI 

0.121 

570. 

71 

175 

TT CTTP\f/*<T TTA 

ILSHSYCLHQ 

0.120 

571. 

72 

195 

T"» 1 7~K TT ri rr r/-( T TT 7 

RVNVVYGLIV 

0.120 

572. 

73 

38 

T T A T f T 0\TT T'T 

LIAVLGNLTI 

0.120 

573. 

74 

43 

GNLTIIYIVR 

0.108 

574. 

75 

55 

ITfiT TTpn\ Jf~\. / f 1"* 

HSLHEPMYIF 

0.101 

575. 

76 

25 

A AT11 TT A T~"T»T 

AQFWLAFPLC 

0.090 

576. 

77 

^ Af\ 

240 

GTC V SHVC A V 

0.090 

577. 

TO 

78 

29 

LAFPLCSLYL 

0.090 

578. 

79 

259 

LSMVHRFSKR 

0.090 

579. 

80 

275 

1 f TT A \Tr\7T T T T 

VILANIYLLV 

0.090 

580. 

81 

120 

1 I i | * l \r> X TT T A T/> 

MAFDRYVAIC 

0.090 

581. 

82 

243 

VSHVCAVFIF 

0.090 

582. 

83 

91 

O' 1 ! "1 'TATTV A /^T 

STTIQFDACL 

0.090 

583. 

84 

247 

CAVFIFYVPF 

0.090 

584. 

85 

205 

TTp A T/T T"~\ T 

IISAIGLDSL 

0.090 

585. 

86 

80 

* MT\T7' A iTT A 1 I 'TT 7I-> 

MPKMLAIFWF 

0.090 

586. 

87 

284 

t 7t\t»t rr x tt** t t rT r 

VPPVLNP1VY 

0.080 

587. 

o o 

88 

230 

T HPTT T™ 1 A S~\ A ~%r A T" 1 

L 1 Rfc AQ A K Ar 

0.075 

588. 

89 

155 

A A T A M A T>T T*T 

AALMAPLPVF 

0.068 

589. 1 

r\r\ 

90 

242 

/-"-IT. rcTTX A T T'T - ' T 

CVSHVCAVFI 

0.060 

590. | 

A 1 

91 

161 

LPVFIKQLPF 

0.060 

591. | 

92 

1 17 

T T A A K A T^T*\TT T/T r 

LLAMAFDRYV 

0.060 

592. 

93 

150 

A "1 T'T 7T» A A T A X A 

AVVRGAALMA 

0.060 

593. 

04 
y*t 

1 AO 

K V 1 1SJ.O V PU\ V 

a (\cr\ 

594. 

95 

244 

SHVCAVFIFY 

0.054 

595. 

96 

180 

YCLHQDVMKL 

0.054 

596. 

97 

46 

TIIYIVRTEH 

0.045 

597. 

98 

241 

TCVSHVCAVF 

0.045 

598. 

99 

92 

TTIQFDACLL 

0.045 

599. 

100 

303 

RILRLFHVAT 

0.045 

600. 
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TABLE XI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-l 101, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 

C 1 f* /"AT~> T7 1 /"WT 1 CTTll M A TT? /"kl^ 1 TT A T TT 1 rr t It MT7* AT? 

SCORE (ES riMATE OF HALF TIME OF 

TATC A CCAr 1 ! A T'T/^VTVT /~VTC A ?V/f T7'^~ 1 TTT TC 

DlaAaaOCIA 1 ION Or A MOLECULE 

rAMTATMIMr 1 TTT TO CTmC^^^lTTli , IV T?\ 

SEQ.ID# 

i 

zzy 

UL 1 KL, At^ AJV 

i ofin 

601. 

Z 

*7A 
/4 

Lib 1 MMrK 

A CAA 

602. 

3 

1 

1 Do 

\A A"DT DXrPTV 

IVlArlJr V r IK 

A <AA 

603. 

A 
4 

0 1 "7 
Zl / 

CtTCVT T TT V 

or 3 i Ll_J.Uv 

A Af\f\ 
U.4UU 

604. 

D 

IOC 

1 Z J 

VTA 7 A TPLIDT "D 

Y V AlCrtr L.K 

A Af\ A 

605. 

£. 

o 

1 1*7 

13 / 

■\/t tt dd \/nrv 
VLlLrKV lis. 

A /I A A 

606. 

/ 

1 lo 

\/T T A TAvT A TT\"D 

V LLAMAr UK 

A 1/CA 

607. 

Q 
O 

1 fiA 
loU 

i V^L>rlv^JJ V IV1K 

A 1AA 

608. 

y 

1 >IC 

140 

IsJUVAAVVK 

A O/IA 
U.Z4U 

609. 

1 A 
1U 

1 £LA 

104 

rn/AT TDT7/~ , T> 

r lKl^LrT LK 

A 1/1A 

U.Z4U 

610. 

1 1 

Zol 

M VrlKr bKKK 

A *>AA 

611. 

1 o 
Iz 

A A 

44 

\TT TTTVR 7TT 

NL111Y1VK 

A 1 ^A 

612. 

13 

O C A 

zoy 

T C X jCA 7TJHD EC T/" 

A 1 OA 

613. | 

1 A 

14 

O/CA 

zoU 

O TV vTA /T_TD TTCT/"D 

A 1 OA 

U.lzU 

614. 

Id 

OQQ 

zy© 

KJilKQKILK 

A 1 AO 

615. 

i it 
lo 

ZOO 

rlCjL5>MVrlK 

A AOA 

U.OoU 

616. 

1 *7 
17 

1 1/1 

134 

TT A TA /T TT T»T> 

HA1 VLILPK 

A AOA 

U.OoU 

617. 

1 o 

lo 

142 

KV IKIOVAA 

n n/cn 
U.060 

618. 

1 A 

19 

1 A*T 

197 

NVVYCjLIVI 

0.060 

619. 

oa 
ZU 

OA1 

zU3 

TA7TTO A T/T 

n n/cn 

620. 

O 1 

zl 

lyD 

t> \ r\.n /a A/r t t 
KVJN V V YOL1 

n n<cn 
U.UoU 

621. 

IT 

zz 

1 fi 

151 

\r\.rr>/~* a a t Ayr a 
V VKuAALMA 

n a /i a 
U.U4U 

622. 

OT 

z3 

1 A C 

24d 

HVtAVrlr Y 

A A /I A 

U-040 

623. 

24 

1 1 

A TV/T7TT T/^T 

Al YrlLICrL 

A f\A A 

0.040 

624. 

o c 

i no 
198 

t r\ T~\r/~* T TA 7TT 

V VYCjLIVII 

A A/A 

0.040 

625. 

O/C 
ZD 

o/i o 
z4o 

AVrir Y Vrr 

A A /I A 

U.U4U 

626. 

OO 

Z / 

TOO 

zoo 

T X1T>TA T\JT^ A /V 

A Ay*A 

627. 

O O 
ZO 

Zdd 

YVrrluLbM 

A f\Af\ 

U.U40 

628. 

zy 

301 

T) /~\T> TT T> T CT_f 

KQKll,lvLr rl 

A A'i 

0.030 

629. 

1 A 

3U 

O 1 

zl 

OLbbAQr WL 

A Al C 

0.036 

630. 

31 

1 c 

ZJ 

A /^V"CA17T A T7T3T 

Al^r WLArrL 

A (\1C 

U.U30 

631. 

11 
3z 

onn 

zyu 

rl V YCjVKIK 

A ATA 

632. 

11 
33 

O/in 
Z4U 

nr<^\ / c TT A /Z^ A 

0 1 C Vorlv W\ 

A ATA 
U.U3U 

633. 

1 A 

34 

zyo 

TS~ T'TV TT ID /"\T> T 

Kl KrilKl^Kl 

A A1A 
U.U3U 

634. i 

3D 

An 
4U 

AA/'T /^XTT TTT 
AVLONL1 11 

A Al A 
V.VDV 

635. 

i£ 
3o 

1 cn 
1 DK) 

AV VKuAALM 

A ATA 

636. 1 

3 / 

1 1 A 

114 

C TA IT J A A H A IT 

A ATA 

637. 

1 c 
3o 

1A£ 

3Uo 

15 T "CXJA/ A TU A 

KLr H V A I rl A 

A AO/1 

U.UZ4 

638. 

-1Q 

jy 

loo 

AC1J1J1K V 

A AO/1 

U.UZ4 

639. 

Af\ 

4U 

O 1 A 

Z 1U 

Vjl^lJoLl^lor 

A AO/1 

U.UZ4 

640. 

/i i 
41 

111 

111 

PA/TCCT\/T T A 

LriVLfco 1 VLLA 

A AO/I 

U.UZ4 

641. 

AO 
4Z 

CA 

T\/DTCUCT TT 

1 VK1 trlbLH 

A AOA 

642. ; 

4.1 
tJ 

94? 

V oXr V v^/-\ v r 

v.UZU 

643. ; 

44 

283 

LVPPVLNPI 

0.020 

644. 

45 

303 

RILRLFHVA 

0.018 

645. 

46 

124 

RYVAICHPL 

0.018 

646. 

47 

201 

GLIVIISAI 

0.018 

647. 

48 

92 

TTIQFDACL 

0.015 

648. 

49 

135 

ATVLTLPRV 

0.015 

649. 
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TABLE XI: 

HLA PEPTIDE SCORING RESULTS -101P3A11- A-l 101, 9-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQJD# 

C A 

1 ^ o 

138 

LTLPRVTKI 

0.015 

S C f\ 

650. 

51 

237 

V A T7PTA\7CTT 

KArOrCVSH 

0.012 

651. 

52 

174 

xttt cucvrr 

0.012 

r r 

652. 

53 

214 

T T TCT?C\/T T 

LLISFSYLL 

0.012 

653. 

54 

275 

VILAN1YLL 

0.012 

654. 

55 

222 

LILKTVLGL j 

0.012 

655. 

56 

294 

GVKTKEIRQ 

0.012 

656. 

57 

1 A "5 

143 

T TTTy T/~* 1 /■ A A17 

VTKJGVAAV 

0.010 

657. 

58 

230 

T r l «l > X" 1 A ^"V A T/" A 

LTREAQAKA 

0.010 

658. 

59 

1 A1 

101 

T 1 1 ' A TTTOT 

LQ1FA1HSL 

0.009 

659. 

60 

AO 

98 

A /"'T T /"NTT"" A T 

ACLLQIFAI 

0.009 

660. 

61 

a n 

47 

IIYIVRTEH 

0.008 

661. 

62 

279 

"VII V/T T \ 7T»T»T T 

N1YLLVPPV 

0.008 

662. 

63 

56 

SLHEPMYIF 

0.008 

663. 

£. A 

64 

65 

LCMLSGIDI 

0.008 

664. 

65 

67 

MLSGIDILI 

0.008 

665. 

66 

208 

AIGLDSLLI 

0.008 

666. 

67 

224 

LKTVLGLTR 

0.008 

667. 

68 

181 

CLHQDVMKL 

0.008 

668. ; 

^ a 

69 

276 

TT A "V II\ TT X T T 

ILANIYLLV 

0.008 

669. 

70 

287 

T TT "VTTkXT r\7/^T T 

VLNPIVYGV 

0.008 

670. 

71 

156 

A T ^A £ A T\T T"\T f f 

ALMAPLPVF 

0.008 

671. 

72 

215 

LISFSYLLI 

0.008 

672. 

73 

251 

IFYVPFIGL 

0.008 

673. 

74 

1 19 

AMAFDRYVA 

0.008 

674. 

75 

41 

VLGNLTIIY 

0.008 

675. 

76 

162 

T*»T 1 1 '1 17 /%T T»T"> 

PVTIKQLPF 

0.008 

676. 

77 

187 

■» £ T7 T a rir\r\TT» 

MKLACDDIR 

0.006 

677. 

78 

293 

YGVKTKEER 

0.006 

678. 

79 

213 

SLLISFSYL 

0.006 

679. 

80 

147 

T 7 A AT TT TT> f — 1 A 

GVAAWRGA 

0.006 

680. 

O 1 

81 

39 

TAT 7T y™^ \TT Hf T 

IAVLGNLTI 

0.006 

681. 

82 

220 

T /T" T TT r T 1 T TT" 

YLLILKTVL 

0.006 

682. 

83 

155 

A A T T M A T^T Tml T 

AALMAPLPV 

0.006 

683. 

84 

274 

TVT 7TT A "V TTX 7TT 

PVILANIYL 

0.006 

684. i 

85 

66 

CMLSGIDIL 

0.006 

685. 

o c 

86 

31 

T"*T»X ^""lOX X TT X 

FPLCSLYLI 

0.006 

686. 

0 1 

87 

126 

T T A T/^TmT T1TT 

VAICHPLRH 

0.006 

687. 

o o 

88 

14 

X""XX T/~*T XIV^T 

FILIGLPGL 

0.006 

688. 

89 

1*7 

37 

T/T TAT TT /TH.TT 

YLIAVLGNL 

0.006 

689. 


AA 

99 

/~«T T /"\TT? A TT T 

CLLQlr A1H 

0.006 

690. 

a i 
91 

"1 OA 

280 

TT/T T T 7T*T»T TT 

IYLLVPPVL 

0.006 

691. 


AQ 

VT\/D TT^UCT 
Yl VKl-EricyL, 

U.UUo 

AO 

93 

246 

VCAVFIFYV 

0.006 

693. 

94 

76 

STSSMPKML 

[ 0.005 

694. 

95 

225 

KTVLGLTRE 

0.005 

695. 

96 

179 

SYCLHQDVM 

0.004 

696. 

97 

29 

LAFPLCSLY 

0.004 

697. 

98 

167 

QLPFCRSNI 

0.004 

698. 
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TABLE XI; 

HLA PEPTIDE SCORING RESULTS -101P3A11- A-l 101 , 9-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.IDtf 

99 

28 

WLAFPLCSL 

0.004 

699. 

100 

117 

LLAMAFDRY 

0.004 

700. 


TABLE XII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-l 101, 10-MERS 


RANK 

START 
FOSvlIION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

1 

258 

GLSMVHRPSK 

3.600 


2 

136 

TVLTLPRVTK 

3.000 

/Uz. 

3 

115 

TVLLAMAFDR 

1.800 

7U3. 

4 

157 

LMAPLPVFIK 

1.200 

/U4. 

5 

73 

ILISTSSMPK 

1.200 

705. 

6 

287 

VLNPIVYGVK 

0.400 

706. 

7 

179 

SYCLHQDVMK 

0.400 

707. 

8 

124 

RYVAICHPLR 

0.360 

708. 

9 

163 

VFDCQLPFCR 

0.180 

709. 

10 

223 

ILKTVLGLTR 

0.160 

710. j 

11 

289 

NPIVYGVKTK 

0.150 

711 . 

12 

294 

GVKTKEIRQR 

0.120 

712. 

13 

195 

RVNVVYGLIV 

0.120 

Til 

713. 

14 

292 

VYGVKTKEIR 

0.080 

714. 

15 

216 

ISFSYLLILK 

0.080 

715. 

16 

186 

VMKLACDDIR 

0.080 

716. 

17 

43 

GNLTIIYIVR 

0.072 

717. 

! 18 

147 

GVAAWRGAA 

0.060 

718. 

19 

245 

HVCAVFIFYV 

0.060 

719. 

1 20 

142 

RVTKJGVAAV 

0.060 

Tin 
/ZU. 

21 

150 

AVVRGAALMA 

0.060 

721. 

22 

40 

AVLGNLTIIY 

0.060 

722. 

23 

260 

SMVHRFSKRR 

0.060 

723. 

24 

301 

RQRILRLFHV 

0.054 

/z4. 

25 

248 

AVFIFYVPFI 

0.040 

725. 

26 

125 

YVAICHPLRH 

0.040 

12o, 

27 

228 

LGLTREAQAK 

0.030 

HOT 

12. /. 

28 

52 

RTEHSLHEPM 

0.030 

/zo. 

29 

240 

GTCVSHVCAV 

0.030 

770 

30 

197 

NVVYGLIVII 

i 0.030 

730. 

31 

166 

KQLPFCRSNI 

| 0.027 

731. 

32 

133 

RHATVLTLPR 

0.024 

732. 

33 

94 

IQFDACLLQI 

0.024 

733. 

34 

111 

GMESTVLLAM 

0.024 

734. 

35 

21 

GLEEAQFWLA 

0.024 

735. 

36 

291 

IVYGVKTKEI 

0.020 

736. 

37 

242 

CVSHVCAVFI 

0.020 

737. 

38 

253 

YVPFIGLSMV 

0.020 

738. 

39 

76 

STSSMPKMLA 

[ 0.020 

739. 

40 

283 

LVPPVLNPIV 

0.020 

740. 

41 

250 

FIFYVPFIGL 

0.016 

741. 
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TABLE XII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-1101, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

42 

92 

TTIQFDACLL 

0.015 

742. 

43 

202 

LIVIISAIGL 

0.012 

743. 

44 

275 

V1LAN1YLLV 

0.012 

744. 

45 

221 

X T TT T^T'I 7~T /^T 

LLILKTVLGL 

0.012 

745. 

46 

229 

T •lift T~~* A A tZ A 

GLTREAQAKA 

0.012 

746. 

47 

214 

X X TflfOTTT X X 

LLISFSYLLI 

0.012 

747. 

48 

18 

y— ix T\/~* T XT' A 

GLPGLEEAQF 

0.012 

748. 

49 

154 

GAALMAPLPV 

0.012 

749. 

50 

66 

CMLSGIDILI 

0.012 

750. 1 

51 

213 

SLLISFSYLL 

0.012 

751. 

52 

252 

FYWFIGLSM 

0.012 

752. 

53 

237 

KAFGTCVSHV 

0.012 

753. 

54 

185 

DVMKLACDDI 

0.012 

754. 

55 

91 

STTIQFDACL 

0.010 

755. 

56 

143 

VTKIGVAAVV 

0.010 

756. 

57 

56 

SLHEPMYIFL 

0.008 

757. 

58 

79 

SMPKMLAIFW 

0.008 

758. 

59 

297 

TKEIRQRILR 

0.008 

759. 

60 

41 

VLGNLTIIYI 

0.008 

760. 

61 

118 

T A r A 1 ' I XT* T 7"W 7 a 

LAMAFDRYVA 

0.008 

761. 

62 

62 

YIFLCMLSGI 

0.008 

762. 

63 

215 

LISFSYLLEL 

0.008 

763. 

64 

64 

FLCMLSGIDI 

0.008 

764. 

65 

199 

VYGLIVIISA 

0.008 

765. 

66 

156 

ALMAPLPVFI 

0.008 

766. 

67 

139 

TLPRVTKIGV 

0.008 

767. 

68 

85 

AIFWFNSTTI 

0.008 

768. 

69 

38 

LIAVLGNLTI 

0.008 

769. 

70 

254 

VPFIGLSMVH 

0.008 

770. 

71 

29 

LAFPLCSLYL 

0.008 

771. 

72 

259 

LSMVHRFSKR 

0.008 

772. 

73 

198 

VVYGLIVIIS 

0.008 

773. 

74 

279 

NIYLLVPPVL 

0.008 

774. 

75 

144 

TT/T/'lt 7 a AT TT TTT 

TKIGVAAVVR 

0.006 

775. 

76 

255 

PFIGLSMVHR 

0.006 

776. 

77 

207 

SAIGLDSLLI 

0.006 

777. 

78 

36 

x x rw TAT TT y*t"V TX 

LYLIAVLGNL 

0.006 

778. 

79 

46 

TIIYTVRTEH 

0.006 

779. 

80 

286 

T*».T TT X TT\ri TT. T T 7 

PVLNPIVYGV 

0.006 

780. ; 

81 

49 

YIVRTEHSLH 

0.006 

781. 

82 

80 

MPKMLAEFWF 

0.006 

782. 

! 83 

161 

T TkT mTTT" AT T\T* 

LPVFIKQLPF 

0.006 

783. 

QA 

zo4 


rv r\r\£. 

U.UUo 

784. 

85 

174 

NILSHSYCLH 

0.006 

785. 

86 

116 

VLLAMAFDRY 

0.006 

786. 

87 

219 

SYLLILKTVL 

0.006 

787. 

! 88 

180 

YCLHQDVMKL 

0.006 

788. 

89 

31 

FPLCSLYLIA 

0.006 

789. 

j 90 

48 

IYIVRTEHSL 

0.006 

790. 
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TABLE XII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A-l 101, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

91 

274 

PVILANIYLL 

0.006 

791. 

92 

273 

LPVILANIYL 

0.006 

792. 

93 

282 

LLVPPVLNPI 

0.006 

793. 

94 


A AAV A T?/~*Tf~*\ 7 

AI^AKAtCj 1 LV 

U.UUo 

794. 

95 

82 

KMLAIFWFNS 

0.005 

795. 

96 

298 

KEIRQRILRL 

0.005 

796. 

97 

230 

LTREAQAKAF 

0.005 

797. 

98 

225 

KTVLGLTREA 

0.005 

798. 

99 

108 

SLSGMESTVL 

0.004 

799. 

100 

10 

SATYFELIGL 

0.004 

800. 


TABLE XIII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A24 , 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

1 

124 

RYVAICHPL 

840.000 

801. 

2 

280 

IYLLVPPVL 

420.000 

802. 

3 

292 

VYGVKTKEI 

55.000 

803. 

4 

30 

AFPLCSLYL 

30.000 

804. 

5 

179 

SYCLHQDVM I 

25.000 

805. 

6 

251 

IFYVPFIGL 

24.000 

806. 

7 

88 

WFNSTTIQF 

15.000 

807. 

8 

219 

SYLLILKTV 

10.500 

808. 

9 

153 

RGAALMAPL 

9.600 

809. 

10 

252 

FYVPFIGLS 

9.000 

01 A 

olO. 

11 

61 

MYIFLCMLS 

9.000 

on 

oil. | 

12 

37 

YLIAVLGNL 

8.400 

O 1 ^ 

812. 

13 

36 

LYLIAVLGN 

7.500 

a 1 1 

ol J. 

14 

249 

VFIFYVPFI 

7.500 

814. 

15 

63 

IFLCMLSGI 

7.500 

815. 

16 

48 

IYIVRTEHS 

7.500 

816. 1 

17 

213 

SLLISFSYL 

7.200 

817. 

18 

207 

SAIGLDSLL 

7.200 

818. 

19 

220 

YLLILKTVL 

7.200 

819. 

20 

34 

CSLYLIAVL 

7.200 

820. 

21 

21 

GLEEAQFWL 

7.200 

\ 821. 

22 

110 

SGMESTVLL 

7.200 

822. 

23 

199 

VYGLIVIIS 

7.000 

| 823. 

24 

92 

TTIQFDACL 

6.000 

I 824. | 

25 

66 

CMLSGIDIL 

6.000 

825. 

26 

174 

NILSHSYCL 

6.000 

826. 

27 

49 

YIVRTEHSL 

6.000 

827. 

28 

275 

VILANIYLL 

6.000 

828. 

29 

149 

AAVVRGAAL 

6.000 

829. 

30 

101 

LQIFAIHSL 

6.000 

830. 

31 

222 

LILKTVLGL 

6.000 

831. 

32 

130 

HPLRHATVL 

6.000 

832. 

33 

214 

LLISFSYLL 

6.000 

833. 
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TABLE Xni: 

HLA PEPTIDE SCORING RESULTS -101P3A11- A24 , 9-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

34 

93 

TIQFDACLL 

6.000 

834. 

35 

14 

FILIGLPGL 

6.000 

835. 

36 

203 

IVIISAIGL 

6.000 

836. 

37 

1 1 

ATYFELIGL 

L 5.600 

837. 

38 

86 

IFWFNSTTI 

5.000 

838. 

39 

95 

QFDACLLQI 

5.000 

839. 

40 

25 

AQFWLAFPL 

4.800 

840. 

41 

168 

LPFCRSNIL 

4.800 

841. 

42 

181 

CLHQDVMKL 

4.400 

842. 

43 

195 

RVNVVYGLI 

4.200 

843. 

44 

296 

KTKEIRQRI 

4.032 

844. 

45 

265 

FSKRRDSPL 

4.000 

845. 

46 

206 

ISAIGLDSL 

4.000 

846. 

47 

216 

ISFSYLLIL 

4.000 

847. 

48 

28 

WLAFPLCSL 

4.000 

848. 

49 

76 

STSSMPKML 

4.000 

849. 

50 

8 

ESSATYFIL 

4.000 

850. 

51 

109 

LSGMESTVL 

4.000 

851. 

52 

299 

EIRQRILRL 

4.000 

852. 

53 

114 

STVLLAMAF 

3.600 

853. 

54 

283 

LVPPVLNPI 

3.024 

854. 

55 

156 

ALMAPLPVF 

3.000 

855. 

56 

257 

IGLSMVHRF 

3.000 

856. 

57 

6 

GNESSATYF 

3.000 

857. 

58 

79 

SMPKMLAIF 

3.000 

858. 

59 

242 

CVSHVCAVF 

2.800 

859. 

60 

210 

GLDSLLISF 

2.400 

860. 

61 

56 

SLHEPMYIF 

2.400 

861. 

62 

201 

GLIVIISAI 

2.100 

862. 

63 

19 

LPGLEEAQF 

2.000 

863. 

64 

248 

AVFIFYVPF 

2.000 

864. 

65 

138 

LTLPRVTKI 

1.980 

865. 

66 

78 

SSMPKMLAI 

1.800 

866. 

67 

197 

NVVYGLIVI 

1.500 

867. 

68 

98 

ACLLQIFAI 

1.500 

868. 

69 

42 

LGNLTIIYI 

I 1.500 

869. 

70 

55 

HSLHEPMYI 

1.500 

870. 

71 

65 

LCMLSGIDI 

1.500 

871. 

72 

39 

IAVLGNLTI 

1.500 

872. 

73 

167 

QLPFCRSNI 

1.500 

873. 

74 

40 

AVLGNLTII 

1.500 

874. 

75 

31 

FPLCSLYLI 

1.500 

875. 

1 o 

i j l 

T \A ADT TJ\7"CT 

LMArLrVrl 

1.440 

876. 

77 

67 

MLSGIDELI 

1.400 

877. 

78 

59 

EPMYIFLCM 

1.260 

878. 

79 

120 

MAFDRYVAI 

1.200 

879. 

80 

9 

SSATYFILI 

1.200 

880. 

81 

198 

VVYGLIVII 

1.200 

881. 

82 

269 

RDSPLPV1L 

1.152 

882. 
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TABLE XHI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A24 , 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

83 

253 

YVPFIGLSM 

1.050 

883. 

84 

243 

VSHVCAVFI 

1.000 

884. 

85 

208 

AIGLDSLLI 

1.000 

885. 

86 

186 

VMKLACDDI 

1.000 

886. 

87 

215 

LISFSYLLI 

1.000 

887. 

88 

194 

IRVNVVYGL 

0.840 

888. 

89 

72 

DILISTSSM 

0.750 

889. | 

90 

163 

VFIKQLPFC 

0.750 

890. 

91 

104 

FAIHSLSGM 

0.750 

891. 

92 

150 

AVVRGAALM 

0.750 

892. [ 

93 

57 

LHEPMYIFL 

0.720 

893. 

94 

12 

T^X / I'll T/^T T* 

TYFILIGLP 

0.600 

894. 

95 

160 

PLPVFIKQL 

0.600 

895. 

96 

26 

QFWLAFPLC 

0.600 

896. 

97 

297 

TKEIRQRIL 

0.600 

897. 

98 

274 

PVILANIYL 

0.600 

898. 

99 

75 

ISTSSMPKM 

0.550 

899. 

100 

82 

KMLAIFWFN 

0.504 

900. 


TABLE XIV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A24, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

1 

36 

LYLIAVLGNL 

420.000 

901. 

2 

219 

SYLLILKTVL 

360.000 

902. | 

3 

48 

IYIVRTEHSL 

300.000 

903. 

4 

252 

FYVPFIGLSM 

63.000 

904. 

5 

264 

RFSKRRDSPL 

40.000 

905. 

6 

13 

YFILIGLPGL 

30.000 

906. 

7 

95 

QFDACLLQIF 

14.400 

907. 

8 

296 

KTKEIRQRIL 

9.600 

908. 

9 

30 

AFPLCSLYLI 

7.500 

909. 

10 

280 

IYLLVPPVLN 

7.500 

910. 

11 

24 

EAQFWLAFPL 

7.200 

911. 

12 

167 

QLPFCRSNIL 

7.200 

912. ; 

13 

159 

APLPVFIKQL 

7.200 

913. 

14 

212 

DSLLISFSYL 

7.200 

914. 

15 

199 

VYGLIVIISA 

7.000 

915. 

16 

180 

YCLHQDVMKL 

6.600 

916. 

17 

65 

LCMLSGIDIL 

6.000 

917. 

18 

92 

TTIQFDACLL 

6.000 

918. 

19 

27 

FWLAFPLCSL 

6.000 

919. 

20 

213 

SLLISFSYLL 

6.000 

920. 

21 

59 

EPMYIFLCML 

6.000 

921. 

22 

273 

LPVILANIYL 

6.000 

922. 

23 

221 

LLILKTVLGL 

6.000 

923. 

24 

202 

LIVIISAIGL 

6.000 

924. 
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TABLE XIV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A24, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 

w worm m,T^*i 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCLATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

25 

173 

SNILSHSYCL 

6.000 

925. 

26 

100 

LLQIFAIHSL 

6.000 

926. 

27 

279 

N1YLLVPPVL 

5.600 

927. 

28 

193 

D1RVNVVYGL 

5.600 

928. 

29 

10 

SATYFILIGL 

5.600 

929. 

30 

250 

FIFYVPFIGL 

4.800 

930. 

31 

29 

LAFPLCSLYL 

4.800 

931. 

32 j 

56 

SLHEPMYTFL 

4.800 

932. 

33 

33 

T / — 1 r™1 T X 7"X TAX 7 T 

LCSLYLIAVL 

4.800 

933. 

34 

206 

XO A T/IT X~V tf> X X 

ISAIGLDSLL 

4.800 

934. | 

35 

209 

IGLDSLLISF 

4.320 

935. 

36 

241 

TCVSHVCAVF 

4.200 

936. 

37 

215 

T T 1 — ' O X 7^T X XX 

LISFSYLLIL 

4.000 

937. 

38 

108 

SLSGMESTVL 

4.000 

938. 

39 

148 

X 7 A AX n 7"1™^ V^x A AX 

VAAWRGAAL 

4.000 

939. 

40 

205 

IISAIGLDSL 

4.000 

940. 

41 

91 

STTIQFDACL 

4.000 

941. 

42 

75 

ISTSSMPKML 

4.000 

942. 

43 

109 

LSGMESTVLL 

4.000 

943. 

44 

166 

KQLPFCRSNI 

3.600 

944. 

45 

78 

SSMPKMLAIF 

3.600 

945. 

46 

5 

NGNESSATYF 

3.600 

946. | 

47 

I 282 

LLVPPVLNPI 

3.024 

947. 

48 

55 

HSLHEPMY1F 

3.000 

948. 

49 

161 

LPVFUCQLPF 

3.000 

949. 

50 

155 

AALMAPLPVF 

3.000 

950. 

51 

247 

CAVFIFYVPF 

3.000 

951. 

52 

18 

GLPGLEEAQF 

3.000 

952. 

53 

299 

EIRQRILRLF 

2.800 

953. 

54 

271 

SPLPVILANI 

2.520 

954. 

55 

103 

IFAIHSLSGM 

2.500 

955. 

56 

230 

LTREAQAKAF 

2.400 

956. 

57 

113 

ESTVLLAMAF 

2.400 

957. | 

58 

66 

CMLSGIDILI 

2.100 

958. 1 

59 

200 

X r/^T XX fTTfl A X 

YGLIVIISAI 

2.100 

| 959. 

60 

256 

FIGLSMVHRF 

2.000 

960. 

61 

j 243 

x rfixxx r/*i a x r*i <tt^ 

VSHVCAVFIF 

2.000 

961. 

62 

87 

FWFNSTTIQF 

2.000 

962. 

63 

80 

MPKMLAIFWF 

2.000 

963. 

64 

156 

A T > * A T\X T"\T FTT 

ALMAPLPVFI 

1.800 

964. 

65 

52 

xx nr^x™' x xo x ttttvi m 

RTEHSLHEPM 

1.800 

965. 

66 

197 

"VTX J"l J"* T T XX 7XX 

NVVYGLIVII 

1.800 

966. 

fin 
o / 


CjlNtslSAl Yrl 

1 CAA 

1.5U0 

967. 

68 

124 

RYVAICHPLR 

1.500 

968. 

69 

214 

LLISFSYLLI 

1.500 

969. 

70 

207 

SAIGLDSLLI 

1.500 

970. 

71 

196 

VNVVYGLIVI 

1.500 

971. 

72 

! 185 

DVMKLACDDI 

1.500 

972. 

73 

39 

IAVLGNLTII 

1.500 

973. 
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TABLE XIV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- A24, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 
LIS UNO 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 

/^ft\irp a TTVTTTVT/""' T'T T TO CTTDC1?/\TTT?M/>17\ 

CONlAININCr I HIS SUBSEQUENCE) 

SEQ.ID# 

HA 

QA 

lv^ r U ACLL Ql 

1.200 

974. 

/J 


T/"CrD /"\T> TT T> T 

1.200 

975. 

/o 

Q i 
8 ! 


1.200 

976. 

/ / 

on i 

IVYGVK1KLI 

1.100 

977. 

no 

78 

in 

137 

VLlLrKV 1 Jvl 

1.100 

978. 

79 

111 

CjJVLhb 1 VLLAM 

1.050 

979. 

on 
80 

Of 

85 

A TT7"\T ^TJXTOTTT 

Air WrNSl 11 

1 Aftft 

1.000 

980. 

O 1 

81 

A 1 

41 

VLGNLI1IYI 

1.000 

981. 

82 

97 

T"\ A /""*T T f*\ II.' A T 

DACLLQIFAI 

1.000 

982. | 

83 

1 19 

AMArDRYVAI 

1.000 

983. | 

84 

77 

T'C 1 OH JTTJ'fli >TT AT 

TS SMPKML AI 

1.000 

984. I 

or 

85 

*3 O 

38 

LIAVLGNLTI 

1.000 

985. 

86 

248 

A T / I'll i\A mFT 

AVFIFYVPFI 

1 .000 

986. 

on 

87 

£i A 

64 

rLCMLSGIDI 

1.000 

987. 

oo 

88 

62 

YIFLCMLSGI 

1.000 

988. 

Oft 

89 

242 

CVSHVCAVFI 

1.000 

989. 

ft ft 

90 

268 

RRDSPLPVIL 

0.960 

990. 

91 

61 

MYIFLCMLSG 

0.750 

991. 

92 

149 

AAWRGAALM 

0.750 

992. 

93 

20 

T*y""*T T" , T~' A /M 1 !! TT 

PGLEEAQFWL 

0.720 

993. 


iz 

TVT7TT T/T ~Dt~* 

ft »7ftft 

0.700 

994. 

95 

! 238 

AFGTCVSHVC 

0.700 

995. 

96 

217 

SFSYLLILKT 

0.660 

996. 

97 

| 129 

CHPLRHATVL 

0.600 

997. 

98 

274 

PVILANIYLL 

0.600 

998. 

99 

123 

DRYVAICHPL 

0.560 

999. 

100 

74 

LISTSSMPKM 

0.550 

1000. 


TABLE XV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

1 

130 

HPLRHATVL 

80.000 

1001. 

2 

168 

LPFCRSNIL 

80.000 

1002. 

3 

59 

EPMYIFLCM 

60.000 

1003. 

4 

299 

EIRQRILRL 

40.000 

1004. 

5 

140 

LPRVTKIGV 

40.000 

1005. 

6 

149 

AAWRGAAL 

36.000 

1006. 

7 

203 

IVIISAIGL 

20.000 

1007. 

8 

150 

AVVRGAALM 

15.000 

1008. 

9 

110 

SGMESTVLL 

12.000 

1009. 

10 

11 

ATYFILIGL 

12.000 

1010. 

11 

25 

AQFWLAFPL 

12.000 

1011. 

12 

207 

SAIGLDSLL 

12.000 

1012. 

13 

31 

FPLCSLYLI 

8.000 

1013. 

14 

28 

WLAFPLCSL 

6.000 

1014. 

15 

40 

AVLGNLTII 

6.000 

1015. 

16 

151 

VVRGAALMA 

5.000 

1016. 
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TABLE XV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

17 

253 

YVPFIGLSM 

5.000 

1017. 

18 

109 

LSGMESTVL 

4.000 

1018. 

19 

275 

VILANIYLL 

4.000 

1019. 

20 

216 

ISFSYLLIL 

4.000 

1020. 

21 

220 

YLLILKTVL 

4.000 

1021. 

22 

181 

CLHQDVMKL 

4.000 

1022. 

23 

254 

VPFIGLSMV 

4.000 

1023. 

24 

222 

LILKTVLGL 

4.000 

1024. 

25 

93 

TIQFDACLL 

4.000 

1025. 

26 

76 

STSSMPKML 

4.000 

1026. 

27 

49 

YT/RTEHSL 

4.000 

1027. 

28 

153 

RGAALMAPL 

4.000 

1028. 

29 

92 

TTIQFDACL 

4.000 

1029. 

30 

14 

FILIGLPGL 

4.000 

1030. 

31 

101 

LQIFAIHSL 

4.000 

1031. 

32 

213 

SLLISFSYL 

4.000 : 

1032. 

33 

174 

NILSHSYCL 

4.000 

1033. 

34 

265 

FSKRRDSPL 

4.000 

1034. 

35 

37 

YLIAVLGNL 

4.000 

1035. 

36 

284 

VPPVLNPIV 

4.000 

1036. 

37 

214 

LLISFSYLL 

4.000 

1037. 

38 

34 

CSLYLIAVL 

4.000 

1038. 

39 

66 

CMLSGIDIL 

4.000 

1039. 

40 

206 

ISAIGLDSL 

4.000 

1040. 

41 

8 

ESSATYFIL 

4.000 

1041. 

42 

104 

FAIHSLSGM 

3.000 

1042. 

43 

283 

LVPPVLNPI 

2.000 

1043. 

44 

289 

NPIVYGVKT 

2.000 

1044. 

45 

195 

RVNVVYGLI 

2.000 

1045. 

46 

197 

NVVYGLIVI 

2.000 

1046. 

47 

3 

DPNGNESSA 

2.000 

1047. 

48 

198 

VVYGLIVII 

2.000 

1048. 

49 

274 

! PVILANIYL 

2.000 

1049. 

50 

118 

LAMAFDRYV 

1.800 

1050. 

51 

155 

AALMAPLPV 

1.800 

1051. 

52 

39 

IAVLGNLTI 

1.200 

1052. 

53 

21 

GLEEAQFWL 

1.200 

1053. 

54 

98 

ACLLQIFAI 

1.200 

1054. 

55 

120 

MAFDRYVAI 

1.200 

1055. 

56 

208 

A X * — 1 X T\riT X X 

AIGLDSLLI 

1.200 

1056. 

57 

78 

SSMPKMLAI 

1.200 

1057. 

58 

65 

LCMLSGIDI 

1.200 

1058. 

jy 

1 A 

30 

a t? n t ret x/ 1 

ArPLCSLYL 

1 OA A 

1.200 

1059. 

60 

304 

ILRLFHVAT 

1.000 

1060. 

61 

72 

DILISTSSM 

1.000 

1061. 

62 

75 

ISTSSMPKM 

1.000 

1062. 

63 

230 

LTREAQAKA 

1.000 

1063. 

64 

235 

QAKAFGTCV 

| 0.600 

1064. 

65 

251 

IFYVPFIGL 

0.600 

1065. 
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TABLE XV: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DIS ASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H># 

66 

167 

QLPFCRSNI 

0.600 

1066. 

67 

296 

KTKEIRQRI 

0.600 

1067. 

68 

135 

A T">1 T~¥ TT T»T»X T 

ATVLTLPRV 

0.600 

1068. 

69 

159 

APLPVFIKQ 

0.600 

1069. 

70 

147 

GVAAWRGA 

0.500 

1070. 

71 

50 

T"l /T» ' 1 '1 'T TOT T T 

IVRTEHSLH 

0.500 

1071. 

72 

136 

T"*"¥i fT THT TVT™% "¥ 7T** 

TVLTLPRVT 

0.500 

1072. 

73 

226 

TVLGLTREA 

0.500 

1073. 

74 

142 

T» ¥ f'Pl/ X ¥i 7 A A 

RVTKIGVAA 

0.500 

1074. 

75 

119 

AMAFDRYVA 

0.450 

1075. 

76 

127 

A T/"1T TT\T TVTT A 

AICHPLRHA 

0.450 

1076. 

77 

148 

T T A AT 7"¥ TTI /"I A A 

VAAVVRGAA 

0.450 

1077. 

78 

273 

LPVILANIY 

0.400 

1078. 

79 

271 

x">*r x mi a "v t 

SPLPVILAN 

0.400 

1079. 

80 

215 

LISFSYLLI 

0.400 

1080. 

81 

269 

RDSPLPVIL 

0.400 

1081. 

82 

80 

MPKMLATFW 

0.400 

1082. 

83 

124 

RYVAICHPL 

0.400 

1083. 

84 

9 

SSATYFILI 

0.400 

1084. 

85 

42 

LGNLTIIYI 

0.400 

1085. 

86 

60 

PMYIFLCML 

0.400 

1086. 

87 

280 

IYLLVPPVL 

0.400 

1087. 

88 

67 

MLSGEDILI 

0.400 

1088. 

89 

132 

LRHATVLTL 

0.400 

1089. 

90 

55 

HSLHEPMYI 

0.400 

1090. 

91 

157 

LMAPLPVFI 

0.400 

1091. 

92 

138 

LTLPRVTKI 

0.400 

1092. 

93 

160 

PLPVFIKQL 

0.400 

1 1093. 

94 

19 

LPOLEEAQF 

0.400 

j 1094. 

95 

194 

IRVNWYGL 

0.400 

L 1095 - 

96 

186 

VMKLACDDI 

0.400 

| 1096. 

97 

201 

GLIVIISAI 

0.400 

1097. 

98 

243 

VSHVCAVFI 

0.400 

1098. 

99 

85 

AIFWFNSTT 

0.300 

1099. 

100 

248 

AVFIFYVPF 

0.300 

1100. 


TABLE XVI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

1 

59 

EPMYIFLCML 

240.000 

1101. 

2 

159 

APLPVFIKQL 

240.000 

1102. 

3 

273 

LPVILANIYL 

80.000 

1103. 

4 

193 

DIRVNVVYGL 

40.000 

1104. 

5 

140 

LPRVTKIGVA 

20.000 

1105. 

6 

65 

LCMLSGIDIL 

12.000 

1106. 

7 

10 

SATYFILIGL 

12.000 

1107. 

8 

24 

EAQFWLAFPL 

12.000 

1108. 


sd-63712 


142 


TABLE XVI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

y 

29 

T A TOT PPT VT 

LAr rLCSL YL 

12.000 

1109. 

1 A 
1U 

145 

VAAVVRuAAL 

12.000 

1110. 

1 1 

149 

AAA 71 7T> /~* A ATI/ 

AA V VRCj AALM 

9.000 

1111. 

1 O 

12 

271 

OT1T T>\ 7TT A "K.TT 

SPLPVILANI 

8.000 

1112. 

13 

IOC 

185 

DVMKLACDDI 

6.000 

1113. 

14 

250 

1*11 "in 7XXX~*T/~»T 

FIFYVPFIGL 

6.000 

1114. 

15 

^ AO 

248 

AVFIFYVPFI 

6.000 

1115. 

16 

1 AO 

108 

SLSGMESTVL 

4.000 

1116. 

17 

91 

PTTTAI '1 \ A T 

STTIQFDACL 

4.000 

1117. 

1 o 

18 1 

AO 

92 

TTIQFDACLL 

4.000 

1118. 

19 

206 

TO A T/~»T r\PT T 

ISAIGLDSLL 

4.000 

1119. 

OA 

20 

ono 

202 

T TT TTTO A T/"^ 1 T 

LIVIISAIGL 

4.000 

1120. 

21 

296 

T7'T , T/""P Ilk /"""VXX XT' 

KTKEIRQRIL 

4.000 

1121. 

22 

212 

DSLLISFSYL 

4.000 

1122. 

23 

205 

IISAIGLDSL 

4.000 

1123. 

24 

131 

XXX XX T T A T'X 7T T»T 

PLRHATVLTL 

4.000 

1124. 

25 

100 

T T y-v 1 1 ' A TTTOT 

LLQIFAJHSL 

4.000 

1125. 

26 

109 

LSGMESTVLL 

4.000 

1126. 

27 

33 

T ✓"^ f> T T 7"T TAT FT 

LCSLYLIAVL 

4.000 

1127. 

28 

221 

t x xx x^tx rx X 

LLILKTVLGL 

4.000 

1128. 

29 

56 

riT Til >TI1 AT > ■ 1""*T 

SLHEPMYIFL 

4.000 

1129. 

30 

215 

X Tfirfl\7T X TX 

LISFSYLLIL 

4.000 

1130. 

31 

173 

SNILSHSYCL 

4.000 

1131. 

32 

167 

QLPFCRSNIL 

4.000 

1132. 

33 

279 

\TTT7T X X 7XXXXX TX 

NIYLLVPPVL 

4.000 

1133. 

34 

75 

ISTSSMPKML 

4.000 

1134. 

35 

213 

SLLISFSYLL 

4.000 

1135. 

36 

180 

YCLHQDVMKL 

4.000 

1136. 

37 

156 

i T T / » T\T ~T~\ T 7 ¥~*T 

ALMAPLPVFI 

3.600 

1137. 

38 

274 

T4T 7"TT A "V ll\ T~T T 

PVILANIYLL 

2.000 

1138. 

39 

301 

xx /An xx xx x x~»x xx r 

RQRILRLFHV 

2.000 

1139. 

40 

197 

\r« tt rr tot TT TTT 

NVVYGLIVII 

2.000 

1140. 

41 

130 

tTTVT TT T T A T'T fT 

HPLRHATVLT 

2.000 

1141. 

42 

31 

FPLCSLYLIA 

2.000 

1142. 

43 

242 

/~ix mxxx 7/^ a x rx~"x 

CVSHVCAVFI 

2.000 

1143. 

A A 

44 

291 

TX 7"X r /~\ X 7X-7" TTV T" T 

IVYGVKTKEI 

2.000 

1144. 

A C 

45 

3 

DPNGNE S S AT 

2.000 

1145. 

A ZT 

46 

150 

A X 7X 7 XX A A I H H A 

AVVRGAALMA 

1.500 

1146. 

A O 

47 

1 1 o 

118 

T A X i A T"<X""VXX X TX 7 A 

LAMAFDRYVA 

1.350 

1147. 

A O 

48 

39 

TAX 7T /*(\TT TITT 

LAVLGNLTII 

1.200 

1148. 

49 

97 

r\ A /^T T T r~* A T 

DACLLQIFAI 

1.200 

1149. 

C A 

50 

85 

AIFWFNSTTI 

1.200 

1150. 

C 1 

J I 

1 1 o 

A Ayf A UnDW/ A T 

AJVlArUKY VA1 

1 OAA 

1151. 

52 

207 

SAIGLDSLLI 

1.200 

1152. 

53 

195 

RVNVVYGLIV 

1.000 

1153. 

54 

178 

HSYCLHQDVM 

1.000 

1154. 

55 

253 

YVPFIGLSMV 

1.000 

1155. 

56 

142 

RVTKIGVAAV 

1.000 

1156. 

57 

283 

LVPPVLNPIV 

| 1.000 

1157. 
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TABLE XVI: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B7, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

58 

245 

HVCAVFIFYV 

1.000 

1158. 

59 

74 

LISTSSMPKM 

1.000 

1159. 

60 

189 

LACDDIRVNV 

0.900 

1160. 

61 

147 

GVAAWRGAA 

0.750 

1161. 

62 

27 

FWLAFPLCSL 

0.600 

1162. 

63 

154 

GAALMAPLPV 

0.600 

1163. 

64 

278 

ANIYLLVPPV 

0.600 

1164. 

65 

166 

KQLPFCRSNI 

0.600 

1165. 

66 

134 

HATVLTLPRV 

0.600 

1166. 

67 

267 

KRRDSPLPVI 

0.600 

1167. 

68 

237 

KAFGTCVSHV 

0.600 

1168. 

69 

234 

AQAKAFGTCV 

0.600 

1169. 

70 

50 

IVRTEHSLHE 

0.500 

1170. 

71 

151 

WRGAALMAP 

0.500 

1171. 

72 

127 

AICHPLRHAT 

0.450 

1172. 

73 

126 

VAICHPLRHA 

0.450 

1173. 

74 

36 

LYLIAVLGNL 

0.400 

1174. 

75 

7 

NESSATYFIL 

0.400 

1175. 

76 

129 

CHPLRHATVL 

0.400 

1176. 

77 

214 

LLISFSYLLI 

0.400 

1177. 

78 

80 

MPKMLAIFWF 

0.400 

1178. 

79 

66 

CMLSGIDILI 

0.400 

1179. 

80 

77 

TSSMPKMLAI 

0.400 

1180. 

81 

168 

LPFCRSNILS 

0.400 

1181. 

82 

137 

VLTLPRVTKI 

0.400 

1182. 

83 

19 

LPGLEEAQFW 

0.400 

1183. 

84 

8 

ESSATYFILI 

0.400 

L 1184 - 

85 

123 

DRYVAICHPL 

0.400 

1185. 

86 

298 

KE1RQR1LRL 

0.400 

1186. 

87 

20 

PGLEEAQFWL 

0.400 

1187. 

88 

94 

IQFDACLLQI 

0.400 

1188. 

89 

284 

VPPVLNPIVY 

0.400 

1189. 

90 

196 

VNVVYGLIVI 

0.400 

1190. 

91 

41 

VLGNLTIIYI 

0.400 

1191. 

92 

264 

RFSKRRDSPL 

0.400 

1192. 

93 

38 

LIAVLGNLTI 

0.400 

1193. 

94 

282 

LLVPPVLNPI 

0.400 

1194. 

95 

48 

IYIVRTEHSL 

0.400 

1195. 

| 96 

13 

YFILIGLPGL 

0.400 

1196. 

| 97 

62 

YIFLCMLSGI 

0.400 

1197. 

| 98 

64 

FLCMLSGIDI 

0.400 

1198. 

99 

161 

LPVFIKQLPF 

0.400 

1199. 

100 

152 

VRGAALMAPL 

0.400 

1200. 
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TABLE XVII: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B-3501, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 

rtrv^m a tat¥at/*i rrrrrn ot TT»ciTi/MTn»T/^n\ 

CONTAINING THIS SUBSEQUENCE) 

SEQ.ID# 

1 

273 

T "m 7TT A \TTT7 

LPVILANIY 

40.000 

1201. 

2 

59 

EPMYIFLCM 

40.000 

1202. 

3 

19 

LPGLEEAQF 

T A AAA 

30.000 

1203. 

4 

80 

MPKMLAIFW 

--> r\ f\r\f\ 

30.000 

1204. 

5 

130 

HPLRHATVL 

20.000 

1205. 

6 

172 

RSNILSHSY 

20.000 

1206. 

7 

168 

LPFCRSNIL 

20.000 

1207. 

8 

265 

FSKRRDSPL 

15.000 

1208. 

9 

140 

LPRVTKIGV 

12.000 

1209. 

10 

212 

DSLLISFSY 

10.000 

1210. 

11 

75 

ISTSSMPKM 

10.000 

1211. 

12 

31 

FPLCSLYLI 

8.000 

1212. 

13 

109 

LSGMESTVL 

7.500 

1213. 

14 

29 

LAFPLCSLY 

6.000 

1214. 

15 

104 

FAIHSLSGM 

6.000 

1215. 

16 

34 

CSLYLIAVL 

5.000 

1216. 

17 

8 

ESSATYFEL 

5.000 

1217. 

18 

206 

ISAIGLDSL 

5.000 

1218. 

19 

216 

ISFSYLLEL 

5.000 

1219. 

20 

296 

KTKEIRQRI 

4.800 

1220. 

21 

284 

VPPVLNPIV 

4.000 

1221. 

22 

5 

NGNESSATY 

4.000 

1222. 

23 

254 

VPFIGLSMV 

4.000 

1223. 

24 

285 

PPVLNPIVY 

4.000 

1224. 

25 

299 

EIRQRILRL 

3.000 

1225. 

26 

55 

HSLHEPMYI 

3.000 

1226. 

27 

149 

AAVVRGAAL 

3.000 

1227. 

28 

207 

SAIGLDSLL 

3.000 

1228. 

29 

120 

MAFDRYVAI 

2.400 

1229. 

30 

110 

SGMESTVLL 

! 2.000 

1230. 

31 

150 

AVVRGAALM 

2.000 

1231. 

32 

56 

SLHEPMYIF 

2.000 

1232. 

33 

78 

SSMPKMLAI 

2.000 

1233. 

34 

1 17 

LLAMAFDRY 

2.000 

1234. 

35 

245 

HVCAVFIFY 

2.000 

1235. 

36 

243 

VSHVCAVFI 

2.000 

1236. 

37 

41 

VLGNLTIIY 

2.000 

1237. 

38 

271 

SPLPVILAN 

2.000 

1238. 

39 

3 

DPNGNESSA 

2.000 

1239. 

40 

72 

DILISTSSM 

2.000 

1240. 

41 

153 

RGAALMAPL 

2.000 

1241. 

42 

253 

YVPFIGLSM 

2.000 

1242. 


y 

cca nrviri i t 

I f\C\f\ 

1243. 

44 

289 

NPIVYGVKT 

2.000 

1244. 

45 

235 

QAKAFGTCV 

1.800 

1245. 

46 

181 

CLHQDVMKL 

1.500 

1246. 

47 

93 

TIQFDACLL 

1.500 

1247. 

48 

39 

IAVLGNLTI 

1.200 

1248. 

49 

186 

VMKLACDDI 

| 1.200 

1249. 
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TABLE XVII: 




HLA PEPTIDE SCORING RESULTS -101P3A11-B-3501, 9-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

C A 

50 

275 

\ ITT A XTTA/'T T 

V1LANIYLL 

1.000 

1250. 

51 

66 

CMLSOIDIL 

1.000 

1251. 

52 

49 

\rri m ' I'L'TTCT 

YIVRTEHSL 

1.000 

1252. 

53 

242 

7Prn 7/*"* A \ 7"C 

CVSHVCAVF 

1.000 

1253. 

C A 

54 

203 

IVIISAIGL 

1.000 

1254. 

55 

1 A 

14 

T-TT tot T>/~*T 

FILIGLPGL 

1.000 

1255. 

56 

222 

T TT T/T\rr /~*T 

L1LKTVLGL 

1.000 

1256. 

57 

25 

a Ani/T a rr>T 

AQFWLAFPL 

1.000 

1257. 

58 

214 

LLISFSYLL 

1.000 

1258. 

59 

1 14 

O T'l TT T A A M A T -1 

STVLLAMAF 

1.000 

1259. 

60 

1 A 1 

101 

T /"*\TT? A TTTCT 

LQIFAIHSL 

1.000 

1260. 

61 

92 

TTIQFDACL 

1.000 

1261. 

62 

257 

IGLSMVHRF 

1.000 

1262. 

63 

28 

1TTT A T"""TkT *~*C*T 

WLAFPLCSL 

1.000 

1263. 

64 

79 

Oik rr\Tf \ AT A TT™ 1 

SMPKMLAIF 

1.000 

1264. 

65 

178 

HSYCLHQDV 

1.000 

1265. 

66 

37 

X TT TAT 7T ✓~*"V TT 

YLIAVLGNL 

1.000 

1266. 

67 

174 

NILSHSYCL 

1.000 

1267. 

68 

156 

ALMAPLPVF 

1.000 

1268. 

69 

213 

f» T T TO T"< f1 7"T 

SLLISFSYL 

1.000 

1269. 

70 

220 

-\ TT T TT T^'T'T TT 

YLLILKTVL 

1.000 

1270. 

71 

248 

A 1 r"PTT*\ TT FTVT^ 

AVFIFYVPF 

1.000 

1271. 

72 

76 

STSSMPKML 

1.000 

1272. 

73 

11 

A f f <\F WS "m f T^^T 

ATYFILIGL 

1.000 

1273. 

74 

189 

LACDDIRVN 

0.900 

1274. 

75 

195 

tt t TTk. tt rr rr r - i t t 

RVNWYGLI 

0.800 

1275. 

76 

68 

LSGIDILIS 

0.750 

1276. 

77 

118 

LAMAFDRYV 

0.600 

1277. 

78 

143 

T fTT/'T/™11 f a ATT 

VTKIGVAAV 

0.600 

1278. 

79 

21 

✓""IT T —> T~* A /NT 1 !! TT 

GLEEAQFWL 

0.600 

1279. 

80 

188 

KLACDDIRV 

0.600 

1280. 

81 

208 

A TOT rvflT T T 

AIGLDSLLI 

0.600 

1281. 

82 

155 

AALMAPLPV 

0.600 

1282. 

83 

230 

LTREAQAKA 

0.600 

1283. 

O A 

84 

113 

T^* O T*! TT" T A % AT" A 

ESTVLLAMA 

0.500 

1284. 

85 

90 

NSTTIQFDA 

0.500 

1285. 

86 

218 

pn\fr T TT T^T" 1 

FSYLLILKT 

0.500 

1286. 

87 

77 

TOPI /TIT/"l iTT A 

TSSMPKMLA 

0.500 

1287. 

oo 

88 

107 

HSLSGMEST 

0.500 

i 1288. 

OA 

T7 A 

270 

rvpriT TTT 7TT A 

DSPLPVLLA 

0.500 

1289. 

a a 

90 

167 

/\T T*T~<y~(T» fl\TT 

QLPFCRSNI 

0.400 

1290. 

a i 
91 

1 AO 

198 

\ T~\ ^ , \/ r /~ , T T \ 7TT 

VVYGLIVII 

0.400 

1291. 


ZU 1 

m TA/TTC A T 

A. AC\C\ 

U.4UU 

1292. 

93 

197 

NVVYGLIVI 

0.400 

1293. 

94 

98 

ACLLQIFAI 

0.400 

1294. 

95 

67 

MLSGIDILI 

0.400 

! 1295. 

96 

138 

LTLPRVTKI 

0.400 

| 1296. 

97 

157 

LMAPLPVFI 

0.400 

I 1297. 

98 

283 

LVPPVLNPI 

0.400 

1298. 
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TABLE XVII: 

HLA PEPTIDE SCORING RESULTS -101P3A11- B-3501, 9-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

99 

40 

AVLGNLTII 

0.400 

1299. 

100 

215 

LISFSYLLI 

0.400 

1300. 


TABLE XVIH: 




HLA PEPTIDE SCORING RESULTS -101P3A11- B-3501, 10-MERS 


RANK 

CT A ¥"> T 1 

hi AKI 
rUM 1 IKJri 

SUBSEQUENCE 

RESIDUE 

LISTING 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 


1 

80 

MPKMLAIFWF 

60.000 

1 101 

1 jUI . 

2 

284 

VPPVLNPIVY 

40.000 

1 109 
1 jUZ. 

3 

273 

LPVILANIYL 

20.000 

i 101 

4 

159 

APLPVFDCQL 

20.000 


5 

59 

EPMYIFLCML 

20.000 

1 10^ 

1 JvJ. 

6 

161 

LPVFDCQLPF 

20.000 

1 lOA 

7 

19 

LPGLEEAQFW 

15.000 


8 

296 

KTKEIRQRIL 

12.000 

1 Iflft 

9 J 

178 

HSYCLHQDVM 

10.000 


10 

271 

SPLPVILANI 

8.000 


11 

140 

LPRVTKIGVA 

6.000 

1111 

12 

230 

LTREAQAKAF 

6.000 


13 

149 

AAWRGAALM 

6.000 

ini 

14 

206 

ISAIGLDSLL 

5.000 

1 llvl 

15 

75 

ISTSSMPKML 

5.000 

111^ 
1 jl J. 

16 

243 

VSHVCAVFIF 

5.000 

1 11 A 

17 

212 

DSLLISFSYL 

5.000 

1117 

18 

113 

ESTVLLAMAF 

5.000 

1 1 1 Q 
1 J lo. 

19 

78 

SSMPKMLAIF 

5.000 

1 1 1 Q 

20 

55 

HSLHEPMYIF 

5.000 

i ion 

21 

109 

LSGMESTVLL 

5.000 

1 171 
1 JZ 1. 

22 

10 

SATYFILIGL 

3.000 

1 17? 

1 jZZ. 

23 

193 

DIRVNVVYGL 

3.000 

1 191 

24 

155 

AALMAPLPVF 

3.000 

1 19A 

1 25 

299 

EIRQRILRLF 

3.000 

1 19S 
1 jZj. 

26 

3 

DPNGNESSAT 

3.000 

1 19£ 
1 jZO. 

27 

148 

VAAWRGAAL 

3.000 

1 197 

1 JZ / . 

28 

29 

LAFPLCSLYL 

3.000 

1 198 

i JZo. 

29 

24 

EAQFWLAFPL 

3.000 

1329. 

30 

247 

CAVFIFYVPF 

3.000 

1330. 

31 

56 

SLHEPMYIFL 

2.000 

1331. 

32 

74 

LISTSSMPKM 

2.000 

1332. 

33 

130 

HPLRHATVLT 

2.000 

1333. 

34 

77 

TSSMPKMLAI 

2.000 

1334. 

35 

5 

NGNESSATYF 

2.000 

1335. 

36 

116 

VLLAMAFDRY 

2.000 

1336. 

37 

28 

WLAFPLCSLY 

2.000 

1337. 

38 

31 

FPLCSLYLIA 

2.000 

1338. 

39 

168 

LPFCRSNILS 

2.000 

1339. 

40 

8 

ESSATYFILI 

2.000 

1340. 

41 

40 

AVLGNLTIIY 

2.000 

1341. 
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TABLE XVin: 




HLA PEPTIDE SCORING RESULTS -101 P3 All- B-3 501, 10-MERS 


RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 

LIJSllNCr 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 

/"^ffcTVTTP A TlVfTTVT/"' 'I'll Id O T TTJO T?> t~\t TK? KT f^T? \ 

CONIA1N1NC* liila SUBSEQUENCE) 

SEQ.ID# 

A *"> 

42 

209 

IGLDSLLISr 

I AAA 

2.000 

1342. 

A 1 

43 

O A*7 

207 

C A J/"" 1 r r\CT t T 

SA1GLDSLLI 

1 O AA 

1 .800 

1343. 

A A 

44 

1 O 

18 

OLPGLEEAQF 

1 C AA 

1.500 

1344. 

A C 

45 

1 AO 

108 

bLSGMESTVL 

1 CAA 

1.500 

1345. 

a a 

46 

1 OA 

lot) 

Y CLHQD VMKL 

1 CAA 

1346. 

A *7 

47 

AO 

92 

TTIQFDACLL 

1 CAA 

1.500 

1347. 

A O 

48 

TIT 

237 

KArGlCVSHV 

1 O AA 

1.200 

1348. 

49 

A^ 

97 

T*\ A /""«T T /~\TT? A T 

DACLLQIFAI 

1 OA A 

1.200 

1349. 

c a 

50 

1 A 

39 

T A \ TT /^~KTT TI 1 

1AVLGNLIII 

1 O AA 

1.200 

1350. 

51 

301 

T> /"\T> TT Ti T rTn/ 

RQRILRLFHV 

i inn 

1.200 

1351. 

52 

52 

T% T'f'T TO T TTPT41 jT 

RTEHSLHEPM 

1.200 

1352. 

C 'I 

53 

1 OA 

189 

LACDDIRVNV 

1 OA A 

1.200 

1353. 

54 

o i o 

218 

"T'O'VTT T TT 1S~ 7 

FSYLLILKTV 

1.000 

1354. 

c c 

55 

241 

T/^X /CTn 7/^1 a T TT"" 

TCVSHVCAVF 

1 AAA 

1.000 

1355. 

56 

O 1 C 

215 | 

T T C T^C 1 "\7~T T TT 

LISFSYLLIL 

1 AAA 

1. 000 

1356. 

57 

33 

T /"< CI T "\ 7T T A 1 TT 

LCSLYLIAVL 

1. 000 

1357. 

58 

202 

T TI 7TTO A T/"' T 

LrVIISAIGL 

1.000 

1358. 

59 

221 

T T TT T/XT TT /IT 

LLILKTVLGL 

1.000 

1359. 

60 

172 

T> PVTTT OTTOXT"^""^ 

RSNILSHSYC 

1.000 

1360. 

61 

100 

T T /MT 1 A tttAt 

LLQIFAIHSL 

1.000 

1361. 

62 

173 

SNILSHSYCL 

1.000 

1362. 

63 

279 

\TT1 »T T -w tt\1M TT 

NIYLLVPPVL 

1.000 

1363. 

64 

107 

T TOT pni <"r"> O TT 7 

HSLSGMESTV 

1.000 

1364. 

65 

65 

LCMLSGIDIL 

1.000 

1365. 

66 

205 

XT O A T / — * T T"\OT 

IISAIGLDSL 

1.000 

1366. 

67 

91 

STTIQFDACL 

1.000 

' 1367. 

68 

250 

FIFYVPFIGL 

1.000 

1368. 

69 

213 

OT T TOrOTTT T 

SLLISFSYLL 

1.000 

1369. 

70 

256 

FIGLSMVHRF 

1.000 

1370. 

71 

167 

^\T T>T , /^r»fl'VTTT 

QLPFCRSNIL 

1.000 

1371. 

72 

235 

s~\ a T/" A T^ /TT" 1 /"I T TO 

QAKAFGTCVS 

0.900 

1372. 

73 

94 

IQFDACLLQI 

0.800 

1373. 

74 

166 

KQLPFCRSNI 

0.800 

1374. 

75 

134 

t t a np\ TT hpt t» ri \ T 

HATVLTLPRV 

0.600 

1375. 

76 

111 

/"""•"Ik A 1 ' O TI TT T A Tk JT 

GMESTVLLAM 

0.600 

1376. 

11 

154 

/~i a a t TV m a "AT T>"\ 7 

GAALMAPLPV 

0.600 

1377. 

78 

143 

T TTTyT T/~* \ T A A \ T\ T 

VTKJGVAAVV 

0.600 

1378. 

T A 

79 

120 

MAFDRYVAIC 

0.600 

1379. 

80 

68 

T COTT*\TT TCT 

LSGIDILIS I 

0.500 

1380. 

O 1 

81 

AA 

90 

\Tp'|"| 'T/-N pi \ A /-^ 

NS1 T1QFDAC 

0.500 

1381. 

82 

TA 

79 

SMPKML AIF W 

A CAA 

0.500 

1382. 

83 

T7A 

270 

r\CnT T»\7TT A XT 

DSPLPVLLAN 

A CAA 

0.500 

1383. 


ZD / 

JV±vl\JJo rLr V 1 


1384. 

85 

118 

LAMAFDRYVA 

0.450 

1385. 

86 

291 

IVYGVKTKEI 

0.400 

1386. 

87 

282 

LLVPPVLNPI 

0.400 

1387. 

88 

242 

CVSHVCAVFI 

0.400 

1388. 

89 

64 

FLCMLSGIDI 

0.400 

1389. 

90 

85 

AIFWFNSTTI 

0.400 

1390. 
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TABLE XVIII: 

HLA PEPTIDE SCORING RESULTS -101P3A11-B-3501, 10-MERS 

RANK 

START 
POSITION 

SUBSEQUENCE 
RESIDUE 
LISTING ! 

SCORE (ESTIMATE OF HALF TIME OF 
DISASSOCIATION OF A MOLECULE 
CONTAINING THIS SUBSEQUENCE) 

SEQ.H)# 

y 1 

1 A ^ 

14z 

KV I Kid VAAV 

0.400 

1391. 

yz 

13 / 

VLILPKV1K1 

0.400 

1392. 

yj 

1 o 

3o 

L1AVLONL1I 

0.400 

1393. 

OA 

oz 

VTPT Z^A/fT cr:T 
Y Ir LOiVLLoOl 

C\ AC\C\ 
U.4UU 


95 

214 

LLISFSYLLI 

0.400 

1395. 

96 

156 

ALMAPLPVFI 

0.400 

1396. 

97 

41 

VLGNLT1IYI 

0.400 

1397. 

98 

196 

VNWYGLIVI 

0.400 

1398. 

99 

248 

AVFIFYVPFI 

0.400 

1399. 

100 

200 

YGLIVHSAI 

0.400 

1400. 


Table XIX: Motifs and Post-translational Modifications of 101P3AU 


10 


N-glycosylation site 
Number of matches: 3 

1 7-10 NESS (SEQ ID NO: 2881) 

2 44-47 NLTI (SEQ ID NO: 2882) 

3 90-93 NSTT (SEQ ID NO: 2883) 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

268-271 RRDS (SEQ ID NO: 2884) 


Protein kinase C phosphorylation site 
1 5 266-268 SKR (SEQ ID NO: 2885) 

Casein kinase II phosphorylation site 
Number of matches: 3 

1 56-59 SLHE (SEQ ID NO: 2886) 
20 2 69-72 SGID (SEQ ID NO: 2887) 

3 110-113 SGME (SEQ ID NO: 2888) 

N-myristoylation site 
Number of matches: 4 
25 1 6-1 1 GNESSA (SEQ ID NO: 2889) 

2 2 1 -26 GLEEAQ (SEQ ID NO: 2890) 

3 11 1-116 GMESTV (SEQ ID NO: 2891) 

4 240-245 GTCVSH 

30 G-protein coupled receptors family 1 signature 

1 12-128 MESTVLLAMAFDRYVAI (SEQ ID NO: 2892) 
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Table XX: Frequently Occurring Motifs 

Name 

avrg. % 
identity 

Description 

Potential Function 

zf-C2H2 

34% 

Zinc finger, C2H2 type 

Nucleic acid-hindinp nrotein functions as 
transcription factor, nuclear location 
probable 

cytochrome b N 

68% 

C^vtochrome hfTvJ- 

terminal)/b6/petB 

mpmhranp hoimH oiriHaQp opnpratp 

superoxide 


19% 

Immunoglobulin domain 

domains are one hundred amino acids long 
and include a conserved intradomain 
disulfide bond. 

WD40 

18% 

WD domain, G-beta repeat 

tandem reneats of ahout 40 residues each 
containing a Trp-Asp motif. Function in 
signal transduction and protein interaction 

PDZ 

23% 

PDZ domain 

may function in targeting signaling 
molecules to sub- membranous sites 

LRR 

28% 

1 Piipinp l^iph Rpnpat 

short sequence motifs involved in protein- 

nrotein interaction*; 

pkinase 

23% 

Protein kinase domain 

conserved catalytic core common to both 
serine/threonine and tyrosine protein 

kinjiQPQ ffintu i n i n o nn ATP rtinninty <2itp anH 

IVllludCo CUlllcllllIllJ^ all rVll UlUUlllg ollw CtllU 

a catalytic site 

PH 

16% 

PH domain 

pleckstrin homology involved in 

intrapplli ilar cicrnalincr or a<; pp*n«;rifiipnt«3 r»T* 
mix divviiiiidi oii^iiainig, \ji ao i-'V/ii&iiiuC'iiis \j i 

the cytoskeleton 

EGF 

34% 

EGF-like domain 

30-40 amino-acid long found in the 

PYtrappI 1 ular Hnmni n r\f mpmhranp-honnrl 
ca ii utti i ui di uwiii&iu ui i iivi i iui cii it; u\j\j. i iu 

proteins or in secreted proteins 

rvt 

49% 

Rpvpr^p tran«5prir»ta^p 

(RNA-dependent DNA 
polymerase) 


ank 

25% 

Ank repeat 

Cytoplasmic protein, associates integral 
membrane proteins to the cytoskeleton 

oxidored ql 

32% 

NADH- 

Ubiquinone/plastoqui none 
(complex I), various chains 

membrane associated. Involved in proton 
translocation across the membrane 
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Table XX: Frequently Occurring Motifs 

Name 

avrg. % 
identity 

Description 

Potential Function 

efhand 

24% 

EF hand 

calcium-binding domain, consists of al2 
residue loop flanked on both sides by a 1 2 
residue alpha-helical domain 

rvp 

79% 

Retroviral aspartyl protease 

Aspartyl or acid proteases, centered on a 
catalytic aspartyl residue 

Collagen 

42% 

Collagen triple helix repeat 
(20 copies) 

extracellular structural proteins involved in 
formation of connective tissue. The 
sequence consists of the G-X-Y and the 
polypeptide chains forms a triple helix. 

fii3 

20% 

Fibronectin type III domain 

Located in the extracellular ligand-binding 
region of receptors and is about 200 amino 
acid residues long with two pairs of 
cysteines involved in disulfide bonds 

7tm 1 

19% 

7 transmembrane receptor 
(rhodopsin family) 

seven hydrophobic transmembrane regions, 
with the N-terminus located extracellularly 
while the C-terminus is cytoplasmic. 
Signal through G proteins 
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Table XXI: Nucleotide sequence of the splice variant. (SEQ ID NO: 1401) 

1 CACATTCCTT CCATACGGTT GAGCCTCTAC CTGCCTGGTG CTGGTCACAG TTCAGCTTCT 

5 61 TCATGATGGT GGATCCCAAT GGCAATGAAT CCAGTGCTAC ATACTTCATC CTAATAGGCC 

121 TCCCTGGTTT AGAAGAGGCT CAGTTCTGGT TGGCCTCCCA TTGTGCTCCC TCTANCTATG 

181 CTGTGCTAGT AATTGACAAT CATCTACATG TGCGGACGAG CACGNCGCNG AGCCCNGTAT 

241 NATTCTGCNG CTTCAGCATG ACACCCTNCA GTCTCAGCCA AAGNGCATCT CNGTCAATCA 

3 01 NACACNTGAG CTGTCGTACG AGTTGCATCA TCCTANGGCA GGATCAATGT GCGGNAGGCN 

10 3 61 TGACGCAGTG CACGTACCAT GGCAGCAAGA CAGGGCCGGT ACAAATGGGG GCGAGNCGGG 

421 GTGAAGATGN ACCCTCGGGT CANAGAGTGC CTCTGCGCCA AAACCTCCAT C ATGNN AAC A 

481 GNGTATAACG GCGNAGAATC GGNNANGCGC AAGGCTAAGG AAANNCCCAA NNCNGGTACT 

541 TTAACCCNGC AAANGGCANC NAAACGGGNG GGTNANTGAA CAAGGAAGGN NTGNAACTGG 

6 01 GCCAAAACGG GNTGGGCAAN NNAAGGACTC ATGGGNCCAA GGGACGGNAA AAGGGGNAAN 

15 6 61 CGGGGCGAAA TGNNAAAAAC CGGGNCCCGG GGAANAANGA AGGGGAANAN GNGTGAAGGA 

721 CNGGGTTCAA GGGAAAAGNA AAACCANGGG NNAGAAACCN TTCNAANGGC CCGGGNANGA 

781 AAGGAANTNN GNNNGGNGAA AAAATCNAAA AAAAGCNGNG GCNNAAAAAN GGGGGGAANN 

841 NAAANACCNN GGNCGNNAAA AAACNNAANG NGGGGGGANT AN AC AC GG AA ANNNANGGGC 

9 01 GNNNAAGGGA AATAANNCGG GAACNAAAGN GCAAACCGNA CGGNAGGAAC GAAACCCACC 

20 9 61 GGAGNCGCNN AACGC CNNNC NNANCCCGAG CNGAGGTNG 

Table XXII: Nucleotide sequence alignment of 101P3A11 (SEQ ID NO: 1402) with the 
splice variant (SEQ ID NO: 1403). 

25 

Score = 337 bits (175), Expect = 4e-89 
Identities = 215/223 (96%), Gaps = 6/223 (2%) 
Strand = Plus / Plus 

30 101P3A11: 68 cacattccttccatacggttgagcctctacctgcctggtgctggtcacagttcagcttct 127 

Ml I II I MINIUM II II III Mill 1 1 Mill 1 1 II MM II Ml II IMIIII II 

Variant : 1 cacattccttccatacggttgagcctctacctgcctggtgctggtcacagttcagcttct 60 
35 101P3A11: 128 tcatgatggtggatcccaatggcaatgaatccagtgctacatacttcatcctaataggcc 187 

1 1 1 1 1 1 1 i M 1 1 if 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 if M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Variant : 61 tcatgatggtggatcccaatggcaatgaatccagtgctacatacttcatcctaataggcc 120 
40 101P3A11: 188 tccctggtttagaagaggctcagttctggttggccttcccattgtgctccctctacctta 247 

1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Variant : 121 tccctggtttagaagaggctcagttctggttggcc- tcccattgtgctccctctanct-- 177 
45 101P3A11: 248 ttgctgtgctaggtaacttgacaatcatctacattgtgcggac 290 

MIMIIMI MM M III II M M II Ml I 

Variant : 178 atgctgtgcta-gtaa- ttgacaatcatctaca- tgtgcggac 217 


50 Table XXIII: Longest single amino acid sequence alignment of 101P3A11 (SEQ ID 
NO: 1404) and the splice variant (SEQ ID NO: 1405) . 

Score = 134 bits (287), Expect(2) = 3e-29 
Identities = 51/51 (100%) 
Frame = +1 / +3 

55 101P3A11: 70 HS FHTVEPLPAWCWSQF SFFMMVDPNGNES S ATYF I L IGL PGLEEAQFWLA 222 

HSFHTVEPLPAWCWSQFSFFMMVDPNGNESSATYFILIGLPGLEEAQFWLA 
Variant : 3 HS FHTVEPLPAWCWSQF SFFMMVDPNGNES SATYF I L IGL PGLEEAQFWLA 155 


60 Table XXIV: Peptide sequences from the translation of the nucleotide sequence of 
the splice variant . 


Open reading 
frame 

Amino acid sequences 

Frame 1 

SEQ ID NO: 1406 

HIPSIRLSLYLPGAGHSSASS*WWIPMAMNPVLHTSS**ASLV*KRLSSGWPPIVLPLXM 
LC * * LTI I YMCGRARRXAXYXSAAS A* HPXVSAKXHLXQSXT* AWRVASSXGRINVRXA 
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*RSARTMAARQGRYKWGRXGVKMXPRVXECLCAKTSIMXTXYNGXESXXRKAKEXPXXGT 
LTXQXAXKRXGX * TRKXXNWAKTGWAXXGLMGPRDGKRGXRGEMXKTGXRGXXKGXXXEG 
XGSREKXNXGXETXXXARXXKEXXXXKKSKKSXGXKXGGXXXPXXXKNXXXGXXTRKXXG 
XXGK* XGNXXANRTXGTKPTGXAXRXXXPEXRX 

Frame 2 

SEQ ID NO: 140/ 

TFL P YG * AS TC L VL VTVQL LHDGGS QWQ * I QC Y I L H PNRP P WF RRGS VL VGL P LC S LXLC 
CASN*QSSTCADEHXAEPXXILXLQHDTLQSQPKXISVNXTXELSYELHHPXAGSMCGRX 
DAVHVPWQQDRAGTNGGEXG*RXTLGSXSASAPKPPSXXQXITAXNRXXARLRKXPXXVL 
* PXKXXXNGXVXEQGRXXTGPKRXGQXKDSWXQGTXKGXXGAKXXKPGPGEXXRGXXVKD 
XVQGKXKTXGXKPFXXPGXERXXXXXKNXKKAXAXKXGEXKXXGRXKTXXGGXXHGXXXA 
XKGNXXGTKXQTXRXERNPPEXXNAXXXPSXG 

Frame 3 

HSFHTVEPLPAWCWSQFSFFMMVDPNGNESSATYFILIGLPGLEEAQFWLASHCAPSXYA 
VLVIDNHLHWTSTXXSPVXFCXFSMTPXSLSQXASXSIXHXSCRTSCIILXQDQCAXGX 
TQCTYHGSKTGPVQMGAXRGEDXPSGXRVPLRQNLrHHXNXV*RRRIGXAQG*GXXQXXYF 
NPAXGXXTGGXXNKEGXXLGQNGXGXXRTHGXKGRXKGXXGRNXKNRXPGXXEGEXX * RT 
GFKGKXKPXXRNXSXGPGXKGXXXGEKIXKKXXXKXGGXXXTXXXKKXXXGGXXTEXXGR 
XRE I XREXKXK PXGRNETHRXRXT PXXXRAE V 


Note: Frame 3 gives the longest subsequence that is identical with 101P3A11 amino acid 

sequence. In this Table each {*) indicates the product of a single stop codon, and 
*X' indicates a single unknown amino acid. 
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TABLE XXV: Properties of 101P3A11 


Bioinformatic 
Program 


URL 


Outcome 


ORF 

Protein Length 
Transmembrane region 


Signal Peptide 

pi 

Molecular weight 
Localization 


Motifs 


ORF Finder 
n/a 

TM Pred 


HMMTop 

Sosui 

TMHMM 

Signal P 
pI/MW tool 
pI/MW tool 
PSORT 

PSORT II 

Pfam 

Prints 
Blocks 

Prosite 


http://www.ncbi.nlm.gov/gorf 
ch.embnet.org/ 


n/a 

http://www.( 


http://www 
http://www 
http://www 
http://psort. 

http://psort. 

http://www. 

http://www. 
http://www. 


133-1086 
(includes stop) 
17 amino acids 
TM at aa : 
27-52, 63-88, 104-129, 
146-165, 196-224, 
239-262, 273-295 
7 TM at aa: 
27-50, 63-86, 99-121, 
146-165,201-224, 
239-262, 275-294 
7 TM, at aa: 
29-51,63-85, 100-122, 
203-225, 239-261, 
273-295 
7 TM, at aa: 
29-51,63-85, 100-122, 
143-165, 202-224, 
236-258, 273-295 
dk/services/SignalP/ indicates no signal 
pi 8.7 
35.2 kDa 

Plasma membrane 
64% 

Plasma membrane 
56.4% 

7 transmembrane 
receptor (rhodopsin 
family) 

Rhodopsin-like GPCR 
superfamily 
Rhodopsin-like GPCR 
superfamily 


http://www.enzim.hu/hmmtop/ 


http://www.genome.ad.jp/SOSui/ 


http://www.cbs.dtu.dk/services/TMHM 
M 


cbs.dtu.' 

expasy.ch/tools/ 
.expasy.ch/tools/ 
nibb.ac.jp/ 

nibb.ac.jp/ 

sanger.ac.uk/Pfam/ 

biochem.ucl.ac.uk/ 
blocks.fhcrc.org/ 


http://www.genome.ad.jp/ 


G-protein coupled 
receptors family 1 
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L 

R 

Li 

F 

H 

V 

14 

1896 

27 

130 

H 

P 

L 

R 

H 

A 

T 

V 

Li 

16 

1947 

78 

5 

N 

G 

N 

E 

S 

S 

A 

T 

Y 

13 

1897 

28 

141 

P 

R 

V 

T 

K 

I 

G 

V 

A 

16 

1948 

79 

23 

E 

E 

A 

Q 

F 

w 

L 

A 

F 

13 

1898 

29 

153 

R 

G 

A 

A 

L 

M 

A 

p 

L» 

16 

1949 

80 

28 

W 

L. 

A 

F 

P 

L 

C 

S 

L 

13 

1899 

30 

181 

C 

L 

H 

Q 

D 

V 

m' 

K 

Li 

16 

1950 

81 

44 

N 

L. 

T 

I 

I 

Y 

I 

V 

R 

13 

1900 

31 

201 

G 

L. 

I 

V 

I 

I 

S 

A 

I 

16 

1951 

82 

51 

V 

R 

T 

E 

H 

S 

L 

H 

E 

13 

1901 

32 

203 

I 

V 

I 

I 

s 

A 

I 

G 

L 

16 

1952 

83 

56 

s 

L 

H 

E 

P 

M 

Y 

I 

F 

13 

1902 

33 

216 

I 

S 

F 

s 

Y 

L 

L 

I 

L 

16 

1953 

84 

60 

P 

M 

Y 

I 

F 

L 

c 

M 

L, 

13 

1903 

34 

222 

L 

I 

L 

K 

T 

V 

L 

G 

L. 

16 

1954 

85 

72 

D 

I 

L, 

I 

S 

T 

S 

S 

M 

13 

1904 

35 

255 

P 

F 

I 

G 

Li 

s 

M 

V 

H 

16 

1955 

86 

74 

L 

I 

S 

T 

s 

s 

M 

P 

K 

13 

1905 

36 

257 

I 

G 

L 

S 

M 

V 

H 

R 

F 

16 

1956 

87 

75 

I 

S 

T 

S 

S 

M 

P 

K 

M 

13 

1906 

37 

275 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

16 

1957 

88 

98 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

13 

1907 

38 

47 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

15 

1958 

89 

104 

F 

A 

I 

H 

S 

L 

S 

G 

M 

13 

1908 

39 

109 

L 

S 

G 

M 

E 

S 

T 

V 

L 

15 

1959 

90 

110 

S 

G 

M 

E 

S 

T 

V 

Ll 

L, 

13 

1909 

40 

114 

S 

T 

V 

L 

L 

A 

M 

A 

F 

15 

1960 

91 

116 

V 

L 

L 

A 

M 

A 

F 

D 

R 

13 

1910 

41 

123 

D 

R 

Y 

V 

A 

I 

C 

H 

P 

15 

1961 

92 

126 

V 

A 

I 

C 

H 

P 

L, 

R 

H 

13 

1911 

42 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

15 

1962 

93 

149 

A 

A 

V 

V 

R 

G 

A 

A 

L, 

13 

1912 

43 

156 

A 

L 

M 

A 

P 

Li 

p 

V 

F 

15 

1963 

94 

158 

M 

A 

P 

L 

P 

V 

F 

I 

K 

13 

1913 

44 

168 

L 

P 

F 

C 

R 

s 

N 

I 

L 

15 

1964 

95 

164 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

13 

1914 

45 

172 

R 

S 

N 

I 

L 

S 

H 

S 

Y 

15 

1965 

96 

170 

F 

C 

R 

S 

N 

I 

L 

S 

H 

13 

1915 

46 

198 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

15 

1966 

97 

171 

C 

R 

S 

N 

I 

L 

S 

H 

S 

13 

1916 

47 

206 

I 

s 

A 

I 

G 

L 

D 

s 

L, 

15 

1967 

98 

187 

M 

K 

L 

A 

C 

D 

D 

I 

R 

13 

1917 

48 

229 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

15 

1968 

99 

217 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

13 

1918 

49 

248 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

15 

1969 

100 

224 

Li 

K 

T 

V 

L 

G 

L 

T 

R 

13 

1919 

50 

251 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

15 

1970 

101 

242 

C 

V 

S 

H 

V 

C 

A 

V 

F 

13 

1920 

51 

274 

P 

V 

I 

L 

A 

N 

I 

Y 

L. 

15 

1971 

102 

256 

F 

I 

G 

L 

S 

M 

V 

H 

R 

13 

1921 

52 

290 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

15 

1972 

103 

261 

M 

V 

H 

R 

F 

S 

K 

R 

R 

13 

1922 

53 

298 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

15 

1973 

104 

49 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

12 

1923 

54 

19 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

14 

1974 

105 

57 

L 

H 

E 

P 

M 

Y 

I 

F 

L 

12 

1924 

55 

29 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

14 

1975 

106 

88 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

12 

1925 

56 

30 

A 

F 

P 

L 

C 

S 

L 

Y 

L. 

14 

1976 

107 

96 

F 

D 

A 

C 

L 

L 

Q 

I 

F 

12 

1926 

57 

39 

I 

A 

V 

L 

G 

N 

L 

T 

I 

14 

1977 

108 

134 

H 

A 

T 

V 

L 

T 

L 

P 

R 

12 

1927 

58 

40 

A 

V 

L 

G 

N 

L 

T 

I 

I 

14 

1978 

109 

152 

V 

R 

G 

A 

A 

L 

M 

A 

P 

12 

sd-63712 
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SEQ 
ID 

NO: HLA-B*2705 nonomers 


Pos score 


1979 



X 

z 

o 

4 

tr 
~> 

c 
D 

-7 
1 

Q 
O 

y 


XXU 

x / y 

a 

Y 


Li 

H 

Q 

D 

V 

TOT 

1 9 
xz 

1980 

in 

1 QT 

xy / 

M 

V 

V 

v 
x 

Lj 

Li 

X 

V 

T 

X 

XZ 

1981 

112 

244 

s 

H 

V 

c 

A 

V 

F 

I 

F 

12 

1982 

113 

265 

F 

S 

K 

R 

R 

D 

S 

p 

L 

12 

1983 

114 

273 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

12 

1984 

115 

285 

P 

P 

V 

Li 

N 

P 

I 

V 

Y 

12 

1985 

116 

288 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

12 

1986 

117 

296 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

12 

1987 

118 

297 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

12 


SEQ 
ID 

NO: HIA-B*2709 nonomers 

Pos score 





1 

2 

3 

4 

5 

6 

7 

8 

9 


1988 

1 

194 

I 

R 

v 

N 

V 

V 

Y 

G 

L 

24 

1989 

2 

268 

R 

R 


s 

P 

L 

P 

V 

I 

24 

1990 

3 

132 

L 

R 

H 

A 

T 

v 

L 

T 

L 

22 

1991 

4 

267 

K 

R 

R 

D 

s 

p 

L 

p 

v 

21 

1992 

5 

300 

I 

R 

o 

R 

I 

L 

R 

L 

F 

20 

1993 

6 

231 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

19 

1994 

7 

302 

o 

R 

I 

L 

R 

L 

F 

H 

v 

19 

1995 

8 

124 

R 

Y 

V 

A 

I 

c 

H 

p 

L 

16 

1996 

9 

269 

R 

D 

s 

P 

L 

p 

v 

I 

L 

16 

1997 

10 

43 

G 

N 

L 

T 

I 

I 

Y 

I 

v 

15 

1998 

11 

216 

I 

s 

F 

s 

Y 

L 

L 

I 

L 

15 

1999 

12 

11 

A 

T 

Y 

F 

I 

L, 

I 

G 

L 

14 

2000 

13 

25 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

14 

2001 

14 

153 

R 

G 

A 

A 

L 

M 

A 

p 

L 

14 

2002 

15 

174 

N 

I 

L 

S 

H 

S 

Y 

C 

L 

14 

2003 

16 

222 

L 

I 

L 

K 

T 

V 

L 

G 

L 

14 

2004 

17 

257 

I 

G 

L 

S 

M 

V 

H 

R 

F 

14 

2005 

18 

280 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

14 

2006 

19 

6 

G 

N 

E 

S 

s 

A 

T 

Y 

F 

13 

2007 

20 

14 

F 

I 

L 

I 

G 

L 

P 

G 

L 

13 

2008 

21 

21 

G 

L 

E 

E 

A 

Q 

F 

W 

L 

13 

2009 

22 

66 

C 

M 

L 

S 

G 

I 

D 

I 

L 

13 

2010 

23 

130 

H 

P 

L 

R 

H 

A 

T 

V 

L 

13 

2011 

24 

201 

G 

L 

I 

V 

I 

I 

S 

A 

I 

13 

2012 

25 

203 

I 

V 

I 

I 

S 

A 

I 

G 

L 

13 

2013 

26 

214 

L 

L 

I 

s 

F 

S 

Y 

L 

L 

13 

2014 

27 

251 

I 

F 

Y 

V 

P 

F 

I 

G 

Lr 

13 

2015 

28 

263 

H 

R 

F 

s 

K 

R 

R 

D 

S 

13 

2016 

29 

275 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

13 

2017 

30 

305 

L 

R 

L 

F 

H 

V 

A 

T 

H 

13 

2018 

31 

30 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

12 

2019 

32 

34 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

12 

2020 

33 

37 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

12 

2021 

34 

51 

V 

R 

T 

E 

H 

S 

L 

H 

E 

12 

2022 

35 

60 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

12 

2023 

36 

75 

I 

S 

T 

S 

S 

M 

P 

K 

M 

12 


sd-63712 


SEQ 
ID 

NO: HIA-B*2709 nonomers 

Pos score 


2024 



i. 

n 
Z 

"3 
-> 

A 
4i 

~> 

D 

7 

o 
o 

Q 


•j / 

y j 

T 

-r 
X 

r\ 
\2 

r 

U 

Jn 

c 

T 

J-J 

T 

J-l 

1 0 

2025 

J O 

iZj 

U 

K 

v 

X 

V 

a 
t\ 

T 

c 

Jti 

P 

1 9 

2026 

^ Q 

j y 

1 J J 

TV 

rp 
1 

\T 
V 

T 

J-j 

rp 

T 
J_i 

p 

P 

x\ 

V 

1 9 

2027 

A f\ 

4 u 

Ijo 

L 

rp 

T 

Li 

TD 
rr 

rj 
K 

V 

rp 

\r 

IX 

T 
X 

1 9 

2028 

A 1 

4 X 

1 A Q 

X4y 

A 

A 

V 

V 

R 

\J 

A 

TV 
A. 

T 

1 9 
XZ 

2029 

A O 

4z 

X DD 

A 

A 

Li 

M 

A 

o 

zr 

J-J 

Jr 

■t r 
V 

1 9 

2030 

A "2 
4.5 

ICO 

loo 

L 

P 

F 

C 

R 

o 
o 

N 

I 

L 

1 9 
XZ 

2031 

A A 
44 

181 

C 

L 

H 

r\ 
\J 

D 

V 

M 

XT 
IS. 

T 

Xi 

1 9 
XZ 

2032 

A R 
4 O 

1 Q Q 

loo 

K 

L 

A 


u 

U 

T 

X 

T? 

K 

V 

1 9 
J.Z 

2033 

A C 
4 D 

i on 

x y u 

A 

C 

D 

D 

I 

R 

V 

JN 

V 

1 9 
XZ 

2034 

A 1 
4 / 

x y d 

R 

V 

M 

V 

17 

V 

V" 
X 

r~> 
Lj 

T 

Li 

X 

1 9 
XZ 

2035 

48 

210 

G 

L 

D 

S 

L 

L 

I 

s 

F 

12 

2036 

49 

213 

S 

L 

L 

I 

S 

F 

s 

Y 

L 

12 

2037 

50 

220 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

12 

2038 

51 

248 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

12 

2039 

52 

279 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

12 

2040 

53 

287 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

lz 

2041 

54 

296 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

Xz 

2042 

55 

299 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

XZ 

SEQ 
ID 
NO : 

HLA-B*5101 

nonomers 







Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

score 

2043 

1 

39 

I 

A 

V 

L 

G 

N 

L 

T 

I 

zo 

2044 

2 

31 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

z b 

2045 

3 

120 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

O A 

z 4 

2046 

4 

130 

H 

P 

L 

R 

H 

A 

T 

V 

L 

z J 

2047 

5 

118 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

zz 

2048 

6 

140 

L 

P 

R 

V 

T 

K 

I 

G 

V 

zz 

2049 

7 

155 

A 

A 

L 

M 

A 

P 

L 

P 

V 

z z 

2050 

8 

42 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

z X 

2051 

9 

254 

V 

P 

F 

I 

G 

Lt 

S 

M 

V 

Z X 

2052 

10 

284 

V 

P 

P 

V 

L 

N 

P 

I 

V 

z X 

2053 

11 

168 

L 

P 

F 

c 

R 

S 

N 

I 

L 

o n 
z U 

2054 

12 

235 

Q 

A 

K 

A 

F 

G 

T 

C 

V 

Z U 

2055 

13 

138 

L 

T 

L 

P 

R 

V 

T 

K 

I 

x y 

2056 

14 

159 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

1 Q 

Xo 

2057 

15 

189 

L 

A 

C 

D 

D 

I 

R 

V 

XT 

JN 

1 Q 
X O 

2058 

16 

198 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

1 Q 

X o 

2059 

17 

277 

L 

A 

N 

I 

Y 

L 

T 
I_l 

V 

Jr 

X o 

2060 

18 

207 

S 

A 

I 

G 

L 

D 

c 

T 
J_l 

T 

Lt 

1 9 
X / 

2061 

19 

283 

L 

V 

P 

P 

V 

L 

N 

r> 
tr 

T 

X 

1 *7 
X / 

2062 

20 

63 

I 

F 

L 

C 

M 

L 

S 

G 

I 

16 

2063 

21 

86 

I 

F 

W 

F 

N 

S 

T 

T 

I 

16 

2064 

22 

110 

S 

G 

M 

E 

S 

T 

V 

L 

L 

16 

2065 

23 

144 

T 

K 

I 

G 

V 

A 

A 

V 

V 

16 

2066 

24 

149 

A 

A 

V 

V 

R 

G 

A 

A 

L 

16 

2067 

25 

197 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

16 

2068 

26 

271 

S 

P 

L 

P 

V 

I 

L 

A 

N 

16 

2069 

27 

280 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

16 


OTTO 

ID 
NO: 

HLA-B*5101 

nonomers 







Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

score 

2070 

28 

3 

D 

P 

N 

G 

N 

E 

S 

S 

A 

15 

2071 

29 

40 

A 

V 

L 

G 

N 

Li 

T 

I 

I 

15 

2072 

30 

97 

D 

A 

c 

L 

L 

Q 

I 

F 

A 

15 

2073 

31 

132 

L 

R 

H 

A 

T 

V 

L 

T 

L 

15 

2074 

32 

222 

L 

I 

L 

K 

T 

V 

L 

G 

L 

15 

2075 

33 

279 

N 

I 

Y 

L 

L. 

V 

P 

P 

V 

15 

2076 

34 

285 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

15 

2077 

35 

289 

N 

p 

I 

V 

Y 

G 

V 

K 

T 

15 

2078 

36 

9 

s 

s 

A 

T 

Y 

F 

I 

L 

I 

14 

2079 

37 

65 

L 

c 

M 

L 

s 

G 

I 

D 

I 

14 

2080 

38 

84 

L 

A 

I 

F 

W 

F 

N 

S 

T 

14 

2081 

39 

126 

V 

A 

I 

C 

H 

P 

L 

R 

H 

14 

2082 

40 

157 

L 

M 

A 

P 

Li 

P 

V 

F 

I 

14 

2083 

41 

158 

M 

A 

P 

L 

P 

V 

F 

I 

K 

14 

2084 

42 

191 

c 

D 

D 

I 

R 

V 

N 

V 

V 

14 

2085 

43 

200 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

14 

2086 

44 

209 

I 

G 

L 

D 

s 

L 

L 

I 

S 

14 

2087 

45 

215 

L 

I 

s 

F 

s 

Y 

L 

L 

I 

14 

2088 

46 

219 

s 

Y 

L 

L 

I 

L 

K 

T 

V 

14 

2089 

47 

220 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

14 

2090 

48 

237 

K 

A 

F 

G 

T 

c 

V 

S 

H 

14 

2091 

49 

247 

C 

A 

V 

F 

I 

F 

Y 

V 

P 

14 

2092 

50 

249 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

14 

2093 

51 

251 

I 

F 

Y 

V 

p 

F 

I 

G 

L 

14 

2094 

52 

257 

I 

G 

L 

s 

M 

V 

H 

R 

F 

14 

2095 

53 

268 

R 

R 

D 

s 

p 

L 

P 

V 

I 

14 

2096 

54 

273 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

14 

2097 

55 

29 

L 

A 

F 

P 

L 

c 

S 

L 

Y 

13 

2098 

56 

33 

L 

c 

s 

L 

Y 

L 

I 

A 

V 

13 

2099 

57 

55 

H 

s 

L 

H 

E 

p 

M 

Y 

I 

13 

2100 

58 

67 

M 

L 

s 

G 

I 

D 

I 

L 

I 

13 

2101 

59 

80 

M 

P 

K 

M 

L 

A 

I 

F 

W 

13 

2102 

60 

95 

Q 

F 

D 

A 

C 

L 

L 

Q 

I 

13 

2103 

61 

98 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

13 

2104 

62 

104 

F 

A 

I 

H 

s 

L 

S 

G 

M 

13 

2105 

63 

146 

I 

G 

V 

A 

A 

V 

V 

R 

G 

13 

2106 

64 

148 

V 

A 

A 

V 

V 

R 

G 

A 

A 

13 

2107 

65 

153 

R 

G 

A 

A 

L 

M 

A 

P 

L 

13 

2108 

66 

233 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

13 

2109 

67 

243 

V 

S 

H 

V 

C 

A 

V 

F 

I 

13 

2110 

68 

292 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

13 

2111 

69 

296 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

13 


sd-63712 
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Table XXVII: 

HLA Class I decamers 


SEQ 
ID 

NO: KLA-Al decamers 




Pos 

1 

2 

3 

4 

5 


7 

8 

9 

o 

score 

2112 

1 

191 

Q 

D 


I 

R 

y 

N 

v 

V 

Y 

27 

2113 

2 

244 


H 

\T 
V 

Q 



p 


p 

J7 

Y 

24 

2114 

3 

40 


yj 

L 

Q 

N 

L 

T 

j 

X 

X 

Y 

2 1 

2115 

4 

284 

■y 

p 

p 


L 

N 

p 

I 

V 

Y 

2 1 

2116 

c. 

lift 
no 

V 

T 
Xj 

T 

A 



p 
Jr 

n 

TD 
K 

v 

9 n 

2117 

5 

9 ft 
z o 

w 

L 

7k 
■T\. 

P 

p 

XT 

T 

J-J 

v_ 

Q 

T 
J-I 

v 

1 ft 

X o 

2118 

7 

297 

T 


W 

Hj 

j 



r\ 

1 

T 
J-I 

•R 
XX 

17 

2119 

8 

2 1 

ri 

V_T 

L 

P 

Cm 

P 
Hi 

7\ 


p 

w 

T 
J-I 


X D 

2120 

q 

9 9 

Z Z 

T 
J-J 


J2» 



r 

TaT 
W 

T 
J_i 

TV 

p 

X D 

2121 

10 

^ 9 
-J Z 


T 

T? 
Ei 

"LJ 
XI 

O 

T 

J-I 

rl 

Hi 

Jr 

EL 

X o 

2122 

1 1 
j. x 

D J 

J. 

P 

TT 
li 

Q 

T 
J-I 

TT 
il 

p 

Jr 

fur 

X 

X D 

2123 

1 9 
X Z 

D / 

T 
J-J 

n 

P 

Jr 

JJf 

lu 

I 

T 

P 
r 

T 
J-I 

^ 

X O 

2124 

13 

111 

G 

M 

E 

s 

T 

V 

L 

L 

A 

M 

16 

2125 

14 

272 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

16 

2126 

15 

1 

M 

V 

D 

P 

N 

G 

N 

E 

S 

S 

15 

2127 

16 

4 

P 

N 

G 

N 

E 

S 

S 

A 

T 

Y 

15 

2128 

17 

121 

A 

F 

D 

R 

Y 

V 


T 

L. 

IT 

rl 


2129 

18 

171 

C 

R 

S 

N 

I 

L 


u 
rl 

o 

X 

X D 

2130 

19 

211 

L 

D 

S 

L» 

L 

I 

a 

r 

o 

v 

X 

1 R 
X D 

2131 

20 

8 

E 

S 

S 

A 

T 

Y 

r 

T 

X 

T 

T 

X J 

2132 

21 

190 

A 

C 

D 

D 

I 

R 

V 

"NT 

INI 

IT 

V 

V 

X J 

SEQ 
ID 

nu : 

HLA-A*0201 decamers 








Pos 

1 

2 

3 

4 

5 

6 

/ 

Q 
O 

Q 

u 

score 

2133 

1 

221 

L 

L 

I 

L 

K 

T 

V 

T 
Jj 

Vj 

T 

J-I 

J u 

2134 

2 

100 

L 

L 

Q 

I 

F 

A 

T 

TT 

n 

o 

T 
J-I 

9 Q 

2135 

3 

282 

L 

L. 

V 

P 

P 

V 

T 

J_» 

"NT 

Jr 

± 

9 9 

2136 

4 

205 

I 

I 

S 

A 

I 

G 

T 

J-I 

T-\ 
JJ 


T 
J-I 

9 & 
Z O 

2137 

5 

213 

S 

L 

L 

I 

S 

F 

Q 

V 
1 

T 
J-I 

T 

J-I 

9 

Z D 

2138 

6 

56 

s 

L 

H 

E 

P 

M 

v 

i 

T 

P 

T 
J-I 


2139 

7 

62 

Y 

I 

F 

L 

C 

M 

T 
J-I 



I 

Z 4 

2140 

8 

108 

S 

L 

S 

G 

M 

E 

o 

O 


TT 

V 

T 

J-I 

9 /t 
Z 4t 

2141 

9 

117 

L 

L 

A 

M 

A 

F 

u 

T-> 

v 

X 

V 

9 A 
Z <i 

2142 

10 

131 

P 

L 

R 

H 

A 

T 

V 

T 

J-i 

rn 
1 

T 
J_l 

9 d 
Z ft 

2143 

11 

137 

V 

L 

T 

L 

P 

R 


T 

zr 
IN. 

T 

9 Zl 
z y 

2144 

12 

215 

L 

I 

S 

F 

S 

Y 

L 

L 

T 
X 

L 

2 4 

2145 

13 

38 

L 

I 

A 

V 

L 

G 

LN 

T 

J_i 

rp 
1 

T 

9 

Z J 

2146 

14 

41 

V 

L 

G 

N 

L 

T 


-L 

v 

X 

9 

Z 3 

2147 

15 

156 

A 

L 

M 

A 

P 

L 

p 

V 

P 

J 

9 

Z J 

2148 

16 

193 

D 

I 

R 

V 

N 

V 

v 

Y 

G 

L 

23 

2149 

17 

214 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

23 

2150 

18 

32 

P 

L, 

C 

S 

L 

Y 

L 

I 

A 

V 

22 

2151 

19 

119 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

22 

2152 

20 

237 

K 

A 

F 

G 

T 

C 

V 

s 

H 

V 

22 

2153 

21 

275 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

22 

2154 

22 

85 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

21 

2155 

23 

139 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

21 

2156 

24 

202 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

21 


SEQ 

ID 

NO: 

HLA-A*0201 decamers 








Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

score 

2157 

25 

13 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

20 

2158 

26 

16 

L 

I 

G 

Li 

P 

G 

L 

E 

E 

A 

20 

2159 

27 

29 

L 

A 

F 

P 

L 

C 

S 

Lt 

Y 

Li 

20 

2160 

28 

142 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

20 

2161 

29 

148 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

20 

2162 

30 

167 

Q 

L 

P 

F 

c 

R 

S 

N 

I 

L 

20 

2163 

31 

180 

Y 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

20 

2164 

32 

222 

Li 

I 

L 

K 

T 

y 

L 

G 

L 

T 

20 

2165 

33 

240 

G 

T 

c 

V 

S 

H 

V 

C 

A 

V 

20 

2166 

34 

248 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

20 

2167 

35 

250 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

20 

2168 

36 

271 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

20 

2169 

37 

279 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L, 

20 

2170 

38 

304 

I 

Li 

R 

L 

F 

H 

V 

A 

T 

H 

20 

2171 

39 

10 

S 

A 

T 

Y 

F 

1 

L 

I 

G 

L 

19 

2172 

40 

15 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

19 

2173 

41 

27 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

19 

2174 

42 

35 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

19 

2175 

43 

37 

Y 

L, 

I 

A 

V 

L 

G 

N 

L 

T 

19 

2176 

44 

44 

N 

Lt 

T 

I 

I 

Y 

I 

V 

R 

T 

19 

2177 

45 

64 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

19 

2178 

46 

83 

M 

L 

A 

I 

F 

W 

F 

N 

S 

T 

19 

2179 

47 

159 

A 

P 

L, 

P 

V 

F 

I 

K 

Q 

L 

19 

2180 

48 

189 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

19 

2181 

49 

207 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

19 

2182 

50 

253 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

19 

2183 

51 

276 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

p 

19 

2184 

52 

281 

Y 

L. 

L 

V 

P 

P 

V 

L 

N 

p 

19 

2185 

53 

283 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

19 

2186 

54 

286 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

19 

2187 

55 

33 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

18 

2188 

56 

36 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

18 

2189 

57 

39 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

18 

2190 

58 

42 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

18 

2191 

59 

66 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

18 

2192 

60 

111 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

18 

2193 

61 

128 

I 

C 

H 

P 

L 

R 

H 

A 

T 

V 

18 

2194 

62 

134 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

18 

2195 

63 

154 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

18 

2196 

64 

157 

L 

M 

A 

P 

L 

P 

V 

F 

I 

K 

18 

2197 

65 

190 

A 

C 

D 

D 

I 

R 

V 

N 

V 

V 

18 

2198 

66 

229 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

18 

2199 

67 

245 

H 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

18 

2200 

68 

274 

P 

V 

I 

L 

A 

N 

I 

Y 

L. 

L 

18 

2201 

69 

278 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

18 

2202 

70 

291 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

18 

2203 

71 

298 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

18 

2204 

72 

48 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

17 

2205 

73 

65 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

17 

2206 

74 

67 

M 

Li 

S 

G 

I 

D 

I 

L 

I 

S 

17 

2207 

75 

74 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

17 

2208 

76 

91 

S 

T 

T 

I 

Q 

F 

D 

A 

C 

L 

17 
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SEQ 
ID 

NO: HIA-A*0201 decamers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

score 

2209 

77 

94 

I 

Q 

F 

D 

A 

C 

L 

L 

Q 

I 

17 

2210 

78 

188 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

17 

2211 

79 

197 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

17 

2212 

80 

200 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

17 

2213 

81 

218 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

17 

2214 

82 

227 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

17 

2215 

83 

303 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

17 

2216 

84 

21 

G 

li 

E 

E 

A 

Q 

F 

W 

L 

A 

16 

2217 

85 

92 

T 

T 

I 

Q 

F 

D 

A 

c 

L 

L» 

16 

2218 

86 

97 

D 

A 

C 

L» 

L 

Q 

I 

F 

A 

I 

16 

2219 

87 

127 

A 

I 

C 

H 

P 

L 

R 

H 

A 

T 

16 

2220 

88 

143 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

16 

2221 

89 

195 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

16 

2222 

90 

220 

Y 

L 

L 

I 

Li 

K 

T 

V 

L 

G 

16 

2223 

91 

296 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

16 

2224 

92 

18 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

15 

2225 

93 

30 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

15 

2226 

94 

126 

V 

A 

I 

C 

H 

P 

L» 

R 

H 

A 

15 

2227 

95 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

15 

2228 

96 

173 

S 

N 

I 

L 

S 

H 

S 

Y 

C 

L 

15 

2229 

97 

201 

G 

L 

I 

V 

I 

I 

s 

A 

I 

G 

15 

2230 

98 

208 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

15 

2231 

99 

210 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

15 

2232 

100 

267 

K 

R 

R 

D 

S 

P 

L 

P 

V 

I 

15 

SEQ 

LU 

NTO: 

KLA-A*0203 decamers 






2233 


Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

score 

1 

141 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

19 

2234 

2 

147 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

19 

2235 

3 

112 

M 

E 

S 

T 

V 

L 

L 

A 

M 

A 

18 

2236 

4 

227 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

18 

2237 

5 

229 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

18 

2238 

6 

142 

R 

y 

T 

K 

I 

G 

V 

A 

A 

V 

17 

2239 

7 

148 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

17 

2240 

8 

2 

V 

D 

P 

N 

G 

N 

E 

S 

S 

A 

10 

2241 

9 

16 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

10 

2242 

10 

21 

G 

L 

E 

E 

A 

Q 

F 

W 

L 

A 

10 

2243 

11 

31 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

10 

2244 

12 

76 

S 

T 

S 

S 

M 

P 

K 

M 

L 

A 

10 

2245 

13 

89 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

10 

2246 

14 

96 

F 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

10 

2247 

15 

110 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

10 

2248 

16 

118 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

10 

2249 

17 

126 

V 

A 

I 

C 

H 

P 

L 

R 

H 

A 

10 

2250 

18 

140 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

10 

2251 

19 

146 

I 

G 

V 

A 

A 

V 

y 

R 

G 

A 

10 

2252 

20 

150 

A 

y 

V 

R 

G 

A 

A 

L, 

M 

A 

10 

2253 

21 

181 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

A 

10 

2254 

22 

199 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

10 

2255 
2256 

23 
24 

225 
239 

K 
F 

T 
G 

V 
T 

L 

C 

G 
V 

L 

S 

T 
H 

R 
V 

E 

C 

A 
A 

10 
10 


sd-63712 


SEQ 
ID 

NO: HIA-A*0203 decamers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

score 

2257 

25 

269 

R 

D 

S 

P 

L 

P 

y 

I 

L 

A 

10 

2258 

26 

302 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

10 

2259 

27 

305 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

10 

2260 

28 

3 

D 

P 

N 

G 

N 

E 

S 

S 

A 

T 

9 

2261 

29 

17 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

9 

2262 

30 

22 

L 

E 

E 

A 

Q 

F 

W 

L 

A 

F 

9 

2263 

31 

32 

P 

L 

C 

S 

L» 

Y 

L 

I 

A 

V 

9 

2264 

32 

77 

T 

S 

S 

M 

P 

K 

M 

L 

A 

I 

9 

2265 

33 

90 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

c 

9 

2266 

34 

97 

D 

A 

C 

Lr 

L 

Q 

I 

F 

A 

I 

9 

2267 

35 

111 

G 

M 

E 

S 

T 

V 

L 

L, 

A 

M 
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I 

L 

A 

N 


Y 

L 

20 

2453 

42 

77 

T 



M 

p 

\r 
iV 

M 

FX 

T 
i-l 

a 

X 

2415 

A 

4 


D 

P 

N 

G 

N 

E 

q 

q 


T 

1 Q 

2454 

*j 

i7 X 

Q 
O 

rp 
X 

rp 
X 

T 
X 


T7 

X* 

Pi 

a 

V-» 

T 
Xj 

2416 

c 

ion 
IJ u 

H 

P 

L 

R 

H 

A 

1 

V 

T 
i-l 

rp 

1 Q 

2455 

A A 


u 

T3 

rv 

V" 
X 

\T 
V 

A 
t\ 

T 
X 


TT 

rl 

TD 
XT 

T 
Xj 

2417 

b 

140 

L 

P 

R 

V 

T 

K 

X 

VJ 

V 

7\ 
/\ 

1 Q 

2456 


1 A9 
X f±Z 

TD 
XX 

V 

rp 
X 

v 

T 
X 


V 

a 

a 

\7 
V 

2418 

1 

lO JL 

L 

P 

V 

F 

I 

K 


T 
±j 

P 

■C> 

r 

1 Q 

2457 

4± D 

ion 
X o u 

v 

X 


T 
i-l 

IT 

rl 


D 

T J 

V 

JM 


T 

Xj 

2419 

o 
o 

J I 

F 

P 

L 

C 

S 

L 

v 

T 
J_i 

T 
X 

a 

_L O 

2458 

A*? 

X zJ u 



i-i 

U 

T 
X 

TD 

V 

IN 

T 7 
V 

V 

2420 

9 

27 1 

S 

P 

L 

P 

V 

I 

T 

J_l 

7\ 
J\ 

TCI 

T 
X 

X o 

2459 

AQ 

0 19 
Z1Z 

iJ 

o 
o 

T 

Xj 

T 

Xj 

X 


rp 

O 

V" 
X 

Xj 

2421 

10 

80 

M 

P 

K 

M 

L. 

A 

T 
X 

Tp 

T»7 

Jr 

X D 

2460 

AQ 

9 ^ A 
A J 41 

7\ 

r\ 

7\ 

rv 


TP 

r 

r> 
yj 

rp 
X 

r> 
v_ 

^7 
V 

2422 

11 

108 

S 

L 

S 

G 

M 

E 

C 

rp 

V 

T 

Xj 

1 

X D 

2461 

_> U 

9 A9 
Z 41 Z 


T7 

V 


rl 

V 

V- 

a 

\7 
V 

rp 

T 
X 

2423 

12 

131 

P 

Li 

R 

H 

A 

T 

T J 

V 

T 

J. 

T 
J_l 

X D 

2462 

D X 

9 Aft 
Z 4t <3 

a 

V 

TT» 

r* 

X 

Jf 

X 

V 

TD 

ir 

TP 

r 

X 

2424 

13 

264 

R 

F 

S 

K 

R 

R 

P, 

c 

p 

ir 

T 
J_l 

1 R 

2463 


9 RH 
Z D U 

r? 
r 

X 


X 

V 

TD 

tr 

rp 

r 

X 

/-^ 

T 

Xj 

2425 

14 

33 

Li 

c 

S 

L 

Y 

L 

T 

X 

a 

V 

T 
i-l 

1 A 
X 4t 

2464 

c; *3 
JJ 

9 R A 
Z D4t 

V 

f 

XT' 

r 

X 

Vj 

Li 

Q 

JM 

■t 7 

V 

TT 

ri 

2426 

15 

109 

L 

S 

G 

M 

E 

S 

m 
1 

V 

T 
Xj 

T 
i-l 

1 A 
X41 

2465 

Df± 

9 ft ft 
ZOO 

o 
o 

In. 

TD 

rs. 

TD 

rs. 

XJ 

c 

TD 
tr 

T 

Xj 

TD 
ir 

\T 
V 

2427 

16 

152 

V 

R 

G 

A 

A 

L 

jj[ 

L v l 


P 

T 

1 A 
X 4 

2466 

cr c 
D D 

9 ft*7 
Z O / 

IS. 

K 

TD 

r\ 

XJ 

o 

TD 
tr 

T 

Xj 

TD 
tr 

\ 7 
V 

T 
X 

2428 

17 

205 

I 

I 

S 

A 

I 

G 

T 

i-l 

p\ 

o 

T 
i_i 

X 4 

2467 

->Q 

9 ft Q 

z o y 

r? 
K 

XI 

o 

TD 

tr 

T 

Xj 

TD 

t r 
V 

X 

T 
Xj 

a 

2429 

18 

215 

L 

I 

s 

F 

S 

Y 

T 

i-J 

T 

i-i 

X 
X 

T 

X 4 

2468 


Z / O 


XT 
JM 

T 

X 

X 

T 

Xj 

T 

Xj 

V 

TD 

TD 

T.7 

V 

2430 

19 

268 

R 

R 

D 

S 

P 

L 

p 

IT 

V 

T 
X 

T 
1_1 

X f± 

2469 

C. Q 
D O 

9 ft A 
Z O 41 

V 

o 

XT 

ir 

V 

T 

Xi 

T\7 

TD 
ir 

T 
X 

V 


2431 

20 

29 

L 

A 

F 

P 

L 

C 

Q 

T 
J-J 

X 

T 

J-l 

X J 

2470 

c; q 
d y 

TOO 

z o y 

M 

f 

X 

V 

~V7 

X 

r> 

\j 

V 

JS. 

rp 
X 

IS. 

2432 

21 

148 

V 

A 

A 

V 

V 

R 

\j 

i-i 

■A 

T 

-Lj 

X J 

2471 

D u 

9 Q ft 
z y o 


rp 
X 

!/■ 

r\ 

TT 1 

T 

X 

TD 


TD 

X 

T 

Xj 

2433 

22 

156 

A 

L 

M 

A 

P 

L 

P 

V 

F 

I 

13 














2434 

23 

193 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

13 














2435 

24 

221 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L, 

13 














2436 

25 

298 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

15 














2437 

26 

7 

N 

E 

S 

S 

A 

T 

Y 

F 

I 

L 

12 














2438 

27 

19 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

12 















score 
12 
12 
12 
12 
12 
12 
12 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 


sd-63712 
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Table XXVIII: 

HLA Class II Epitopes (sample 15-mer length) 


SEQ 
ID 

NO: HLA-DRB1*0101 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

z4 / z 

1 

200 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

36 

O ATI 

2 

68 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

34 

O A n A 
Z 4 / 4 

3 

62 

Y 

I 

F 

L 

C 

M 

Li 

S 

G 

I 

D 

I 

L 

I 

S 

33 

Z 4 / D 

4 

103 

I 

F 

A 

I 

H 

S 

L 

s 

G 

M 

E 

s 

T 

V 

L 

32 

z 4 / b 

5 

45 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

s 

L 

H 

E 

31 

Z 4 / / 

6 

193 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

31 

O .4 T Q 

7 

277 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

31 

z4 / y 

8 

97 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

s 

L 

S 

G 

30 


9 

106 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

V 

Li 

L 

A 

M 

30 

Z4ol 

10 

240 

G 

T 

C 

V 

S 

H 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

30 

z4oz 

11 

10 

S 

A 

T 

Y 

F 

I 

L 

I 

G 

Li 

P 

G 

L 

E 

E 

29 


12 

289 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

R 

29 

2 484 

13 

11 

A 

T 

Y 

F 

I 

L 

I 

G 

L 

p 

G 

L 

E 

E 

A 

28 

*■> >i o c 

z4ob 

14 

250 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

27 

2486 

15 

140 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

26 

z4o / 

16 

183 

H 

Q 

D 

V 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

26 

z4oo 

17 

217 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

Li 

T 

26 

2489 

18 

16 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

L 

A 

25 

2 49 0 

19 

24 

E 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

c 

S 

L 

Y 

L 

25 

2491 

20 

36 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

25 

2 492 

21 

70 

G 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

25 

2493 

22 

111 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

25 

2494 

23 

148 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

25 

2495 

24 

162 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

25 

249 6 

25 

197 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

25 

2497 

26 

211 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

25 

z 4y o 

27 

218 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

25 

z 4y y 

28 

13 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

24 

2 500 

29 

30 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

24 

o c n 1 

30 

39 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

24 

n c n o 
z bUz 

31 

77 

T 

S 

s 

M 

P 

K 

M 

L 

A 

I 

F 

W 

F 

N 

S 

24 


32 

85 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

C 

24 

O C n /I 

z bU4 

33 

137 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

24 

o c: n c 

34 

151 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

F 

I 

24 

t c n £T 
ZD U O 

35 

161 

L 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

24 

Z 3 U / 

36 

196 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

24 

n c n o 
z D U o 

37 

202 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

24 

z d uy 

38 

208 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

24 

2510 

Q 

OA $3 
Z *± O 

T\ 
J\ 

V 

r 

T 

t? 
r 

v 

X 

T 7 
V 

ir 

TP 

r 

T 
± 

c 
<j 

T 
Lt 

b 

"M 
iXl 

V 

Z 4 

2511 

40 

251 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

s 

M 

V 

H 

R 

F 

24 

2512 

41 

83 

M 

Li 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

23 

2513 

42 

101 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

23 

2514 

43 

165 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

S 

H 

S 

23 

2515 

44 

203 

I 

V 

i 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

23 

2516 

45 

221 

L 

L, 

i 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

23 

2517 

46 

278 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

23 

2518 

47 

27 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

22 

2519 

48 

35 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

22 

sd-63712 














167 




SEQ 
ID 

NO: HLA-DRB1*0101 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2520 

49 

61 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

22 

2521 

50 

65 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

S 

22 

2522 

51 

80 

M 

P 

K 

M 

L 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

22 

2523 

52 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

22 

2524 

53 

146 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

22 

2525 

54 

154 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

L 

22 

2526 

55 

205 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

22 

2527 

56 

243 

V 

S 

H 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

22 

2528 

57 

270 

D 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

22 

2529 

58 

274 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L» 

22 

2530 

59 

281 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

22 

2531 

60 

34 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

21 

2532 

61 

69 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

21 

2533 

62 

152 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

F 

I 

K 

21 

2534 

63 

299 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

21 

2535 

64 

100 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

E 

S 

20 

2536 

65 

135 

A 

T 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

20 

2537 

66 

141 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

20 

2538 

67 

191 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

20 

2539 

68 

199 

V 

Y 

G 

L 

I 

V 

I 

I 

s 

A 

I 

G 

L 

D 

S 

20 

2540 

69 

262 

V 

H 

R 

F 

S 

K 

R 

R 

D 

S 

P 

L 

P 

V 

I 

20 

2541 

70 

271 

s 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

20 

2542 

71 

28 

w 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

19 

2543 

72 

58 

H 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

19 

2544 

73 

59 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

19 

2545 

74 

60 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

19 

2546 

75 

98 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

19 

2547 

76 

215 

L 

I 

S 

F 

s 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

19 

2548 

77 

219 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

19 

2549 

78 

228 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

C 

19 

2550 

79 

232 

R 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

C 

V 

S 

H 

V 

19 

2551 

80 

246 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

19 

2552 

81 

297 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

19 

2553 

82 

3 

D 

P 

N 

G 

N 

E 

S 

S 

A 

T 

Y 

F 

I 

L 

I 

18 

2554 

83 

14 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

18 

2555 

84 

25 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

18 

2556 

85 

42 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

s 

18 

2557 

86 

46 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

p 

18 

2558 

87 

78 

S 

S 

M 

P 

K 

M 

L 

A 

I 

F 

W 

F 

N 

S 

T 

18 

2559 

88 

84 

L 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

18 

2560 

89 

89 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

c 

L 

L 

Q 

I 

18 

2561 

90 

93 

T 

I 

Q 

F 

D 

A 

c 

L 

L 

Q 

I 

F 

A 

I 

H 

18 

2562 

91 

115 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

18 

2563 

92 

119 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

H 

P 

L 

R 

18 

2564 

93 

127 

A 

I 

C 

H 

P 

L 

R 

H 

A 

T 

V 

L 

T 

L 

P 

18 

2565 

94 

129 

C 

H 

P 

L 

R 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

18 

2566 

95 

147 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

18 

2567 

96 

149 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

18 

2568 

97 

216 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

18 

2569 

98 

227 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

18 
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SEQ 
ID 

NO: HLA-DRB1*0101 15-mers 


Pos 




1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

2570 99 

249 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

2571ioo 

253 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

S 

K 

2572ioi 

284 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

2573102 

286 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

2574io3 

303 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

E 

P 


score 

18 
18 
18 
18 
18 


SEQ 

ID 

NO: 


HLA-DRB1*0301 (DR17) 15-mers 
Pos 





1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

2575 

1 

16 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

L 

A 

2576 

2 

206 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

2577 

3 

91 

S 

T 

T 

I 

Q 

F 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

2578 

4 

117 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

H 

P 

2579 

5 

38 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

2580 

6 

179 

S 

Y 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

A 

c 

D 

D 

2581 

7 

211 

L 

D 

s 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

2582 

8 

219 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

2583 

9 

272 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

2584 

10 

26 

Q 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

2585 

11 

114 

S 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

2586 

12 

129 

c 

H 

p 

L 

R 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

2587 

13 

134 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

2588 

14 

186 

V 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

2589 

15 

200 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

s 

L 

2590 

16 

270 

D 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

2591 

17 

297 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

2592 

18 

11 

A 

T 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

2593 

19 

54 

E 

H 

S 

L 

H 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

2594 

20 

106 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

2595 

21 

165 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

S 

H 

S 

2596 

22 

191 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

2597 

23 

203 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

2598 

24 

213 

s 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

2599 

25 

224 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

2600 

26 

227 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

2601 

27 

248 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

2602 

28 

254 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

s 

K 

R 

2603 

29 

277 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

2604 

30 

36 

Lr 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

2605 

31 

93 

T 

I 

Q 

F 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

2606 

32 

98 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

2607 

33 

125 

Y 

V 

A 

I 

c 

H 

P 

L 

R 

H 

A 

T 

V 

L 

T 

2608 

34 

158 

M 

A 

P 

L 

p 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

2609 

35 

187 

M 

K 

L 

A 

c 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

2610 

36 

217 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

2611 

37 

225 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

2612 

38 

281 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

i 

V 

Y 

G 

V 

2613 

39 

288 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

2614 

40 

18 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

L 

A 

F 

p 


score 


26 
23 
22 
22 
21 
21 
21 
21 
21 
20 
20 
20 
20 
20 
20 
20 
20 
19 
19 
19 
19 
19 
19 
19 
19 
19 
19 
19 
19 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
17 
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169 


SEQ 
ID 

NO: HLA-DRB1*0301 (DR17) 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

scoxre 

2615 

41 

44 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

17 

2616 

42 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

17 

2617 

43 

159 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

17 

2618 

44 

256 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

D 

17 

2619 

45 

259 

L 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

D 

S 

P 

L 

17 

2620 

46 

137 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

16 

2621 

47 

262 

V 

H 

R 

F 

S 

K 

R 

R 

D 

S 

P 

L 

P 

V 

I 

16 

2622 

48 

294 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

F 

16 

2623 

49 

46 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

P 

15 

2624 

50 

51 

V 

R 

T 

E 

H 

S 

L 

H 

E 

P 

M 

Y 

I 

F 

L 

15 

2625 

51 

172 

R 

S 

N 

I 

L 

S 

H 

S 

Y 

C 

L 

H 

Q 

D 

V 

15 

2626 

52 

189 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

15 

2627 

53 

212 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

15 

2628 

54 

218 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

15 

2629 

55 

271 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

15 

2630 

56 

279 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

15 

2631 

57 

12 

T 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

14 

2632 

58 

35 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

14 

2633 

59 

64 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

14 

2634 

60 

140 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

14 

2635 

61 

273 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

14 

2636 

62 

301 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

14 

2637 

63 

13 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

13 

2638 

64 

47 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

P 

M 

13 

2639 

65 

71 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

A 

13 

2640 

66 

80 

M 

P 

K 

M 

L 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

13 

2641 

67 

109 

L 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

A 

F 

D 

13 

2642 

68 

113 

E 

S 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

13 

2643 

69 

135 

A 

T 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

13 

2644 

70 

195 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

13 

2645 

71 

202 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

13 

2646 

72 

220 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

13 

2647 

73 

221 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

13 

2648 

74 

264 

R 

F 

S 

K 

R 

R 

D 

S 

P 

L 

P 

V 

I 

L 

A 

13 

2649 

75 

280 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

13 

2650 

76 

302 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

E 

13 


SEQ 
ID 

NO: HLA-DRB1*0401 (DR4Dw4) 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2651 

1 

36 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

26 

2652 

2 

45 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

26 

2653 

3 

68 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

s 

M 

P 

K 

26 

2654 

4 

83 

M 

L 

A 

I 

F 

w 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

26 

2655 

5 

134 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

26 

2656 

6 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

26 

2657 

7 

224 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

26 

2658 

8 

227 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

26 

2659 

9 

256 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

D 

26 
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170 


SEQ 
ID 

NO: HLA-DRB1*0401 (DR4Dw4) 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2660 

10 

281 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

26 

2661 

11 

289 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

R 

26 

2662 

12 

301 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

26 

2663 

13 

11 

A 

T 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

22 

2664 

14 

24 

E 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

22 

2665 

15 

25 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

22 

2666 

16 

34 

C 

s 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

22 

2667 

17 

84 

L 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

22 

2668 

18 

122 

F 

D 

R 

Y 

V 

A 

I 

c 

H 

P 

Li 

R 

H 

A 

T 

22 

2669 

19 

197 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

22 

2670 

20 

215 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

22 

2671 

21 

217 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

22 

2672 

22 

250 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

22 

2673 

23 

278 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

22 

2674 

24 

19 

L 

P 

G 

L, 

E 

E 

A 

Q 

F 

W 

L 

A 

F 

P 

L 

20 

2675 

25 

30 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

20 

2676 

26 

33 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

20 

2677 

27 

35 

S 

L 

Y 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

20 

2678 

28 

39 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

20 

2679 

29 

42 

L 

G 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

20 

2680 

30 

44 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

20 

2681 

31 

48 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

P 

M 

Y 

20 

2682 

32 

58 

H 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L, 

S 

G 

I 

D 

20 

2683 

33 

62 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

.1 

L 

I 

S 

20 

2684 

34 

65 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

s 

20 

2685 

35 

71 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

A 

20 

2686 

36 

80 

M 

P 

K 

M 

L 

A 

I 

F 

w 

F 

N 

S 

T 

T 

I 

20 

2687 

37 

81 

P 

K 

M 

L 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

20 

2688 

38 

91 

S 

T 

T 

I 

Q 

F 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

20 

2689 

39 

97 

D 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

20 

2690 

40 

98 

A 

C 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

20 

2691 

41 

100 

L 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

E 

S 

20 

2692 

42 

103 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

V 

L 

20 

2693 

43 

106 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

20 

2694 

44 

115 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

20 

2695 

45 

117 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

H 

P 

20 

2696 

46 

125 

Y 

V 

A 

I 

C 

H 

P 

Li 

R 

H 

A 

T 

V 

L 

T 

20 

2697 

47 

129 

C 

H 

P 

L 

R 

H 

A 

T 

V 

L 

T 

L 

P 

R 

V 

20 

2698 

48 

137 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

20 

2699 

49 

140 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

20 

2700 

50 

155 

A 

A 

L 

M 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

L 

P 

20 

2701 

51 

162 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

20 

2702 

52 

165 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

S 

H 

S 

20 

2703 

53 

179 

S 

Y 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

A 

C 

D 

D 

20 

2704 

54 

183 

H 

Q 

D 

V 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

20 

2705 

55 

186 

V 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

20 

2706 

56 

193 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

20 

2707 

57 

196 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

20 

2708 

58 

199 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

20 

2709 

59 

200 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

20 
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SEQ 
ID 

NO: HLA-DRB1*0401 (DR4Dw4) 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2710 

60 

202 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

20 

2711 

61 

203 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

20 

2712 

62 

206 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

20 

2713 

63 

208 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

20 

2714 

64 

211 

L 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

20 

2715 

65 

212 

D 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

20 

2716 

66 

218 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

20 

2717 

67 

240 

G 

T 

C 

V 

S 

H 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

20 

2718 

68 

243 

V 

S 

H 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

20 

2719 

69 

246 

V 

C 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

20 

2720 

70 

248 

A 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

20 

2721 

71 

251 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

20 

2722 

72 

272 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

20 

2723 

73 

277 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

20 

2724 

74 

285 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

20 

2725 

75 

18 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

w 

L 

A 

F 

P 

18 

2726 

76 

27 

F 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

18 

2727 

77 

69 

S 

G 

I 

D 

I 

L 

I 

s 

T 

S 

S 

M 

P 

K 

M 

18 

2728 

78 

94 

I 

Q 

F 

D 

A 

C 

L 

L» 

Q 

I 

F 

A 

I 

H 

S 

18 

2729 

79 

99 

c 

L 

L 

Q 

I 

F 

A 

I 

H 

s 

L 

S 

G 

M 

E 

18 

2730 

80 

107 

H 

S 

L 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

A 

18 

2731 

81 

116 

V 

L 

L» 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

I 

C 

H 

18 

2732 

82 

126 

V 

A 

I 

C 

H 

P 

L 

R 

H 

A 

T 

V 

L 

T 

L 

18 

2733 

83 

164 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

S 

N 

I 

L 

S 

H 

18 

2734 

84 

176 

L 

S 

H 

S 

Y 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

A 

18 

2735 

85 

187 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 
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2736 

86 

205 

I 

I 

S 

A 

I 

G 

L 

D 

S 

L 

L 

I 

S 

F 

S 

18 

2737 

87 

233 

E 

A 

Q 

A 

K 

A 

F 

G 

T 

C 

V 

S 

H 

V 

C 

18 

2738 

88 

237 

K 

A 

F 

G 

T 

c 

V 

S 

H 

V 

c 

A 

V 

F 

I 

18 

2739 

89 

271 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

18 

2740 

90 

293 

Y 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

L 

R 

L 

18 

2741 

91 

.294 

G 

V 

K 

T 

K 

E 

I 
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Q 

R 

I 

L 

R 

L 

F 

18 

2742 

92 

10 
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A 

T 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

16 

2743 

93 

28 

W 

L 

A 

F 

P 

L 

C 

S 

L 

Y 

L 

I 

A 

V 

L 

16 

2744 

94 

59 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

16 

2745 

95 

61 

M 

Y 

I 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

16 

2746 

96 

85 

A 

I 

F 

W 

F 

N 

S 

T 

T 

I 

Q 

F 

D 

A 

C 

16 

2747 

97 

101 

L 

Q 

I 

F 

A 

I 

H 

S 

L 

S 

G 

M 

E 

S 

T 

16 

2748 

98 

177 

S 

H 

S 

Y 

C 

L 

H 

Q 

D 

V 

M 

K 

L 

A 

C 

16 

2749 

99 

236 

A 

K 

A 

F 

G 

T 

C 

V 

S 

H 

V 

C 

A 

V 

F 

16 

2750 

100 

249 

V 

F 

I 

F 

Y 

V 

P 

F 

I 

G 

L 

s 

M 

V 

H 

16 

2751 

101 

253 

Y 

V 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

S 

K 

16 

2752 

102 

13 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

14 

2753 

103 

14 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

W 

14 

2754 

104 

16 

L 

I 

G 

L 

P 

G 

L 

E 

E 

A 

Q 

F 

w 

L 

A 

14 

2755 

105 

38 

L 

I 

A 

V 

L 

G 

N 

L 

T 

I 

I 

Y 

i 

V 

R 

14 

2756 

106 

47 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

E 

P 

M 

14 

2757 

107 

54 

E 

H 

S 

L 

H 

E 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

14 

2758 

108 

60 

P 

M 

Y 

I 

F 

L 

C 

M 

L 

s 

G 

I 

D 

I 

L 

14 

2759 

109 

64 

F 

L 

C 

M 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

14 
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SEQ 
ID 

NO: HLA-DRB1*0401 (DR4Dw4) 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2760 

110 

70 

G 

I 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

14 

2761 

111 

72 

D 

I 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

A 

I 

14 

2762 

112 

109 

Li 

S 

G 

M 

E 

S 

T 

V 

L 

L 

A 

M 

A 

F 

D 

14 

2763 

113 

113 

E 

s 

T 

V 

L 

L 

A 

M 

A 

F 

D 

R 

Y 

V 

A 

14 

2764 

114 

135 

A 

T 

V 

L 

T 

It 

P 

R 

V 

T 

K 

I 

G 

V 

A 

14 

2765 

115 

143 

V 

T 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

14 

2766 

116 

148 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

14 

2767 

117 

149 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

P 

L 

P 

V 

14 

2768 

118 

154 

G 

A 

A 

li 

M 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

L 

14 

2769 

119 

158 

M 

A 

P 

II 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

c 

R 

14 

2770 

120 

173 

S 

N 

I 

L 

S 

H 

S 

Y 

C 

L 

H 

Q 

D 

V 

M 

14 

2771 

121 

184 

Q 

D 

V 

M 

K 

L 

A 

C 

D 

D 

I 

R 

V 

N 

V 

14 

2772 

122 

191 

C 

D 

D 

I 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

14 

2773 

123 

195 

R 

V 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

14 

2774 

124 

213 

S 

L 

L 

I 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

14 

2775 

125 

220 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

14 

2776 

126 

221 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

14 

2777 

127 

225 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

F 

14 

2778 

128 

259 

L 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

D 

S 

P 

L 

14 

2779 

129 

270 

D 

S 

P 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

14 

2780 

130 

273 

L 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

14 

2781 

131 

274 

P 

V 

I 

L 

A 

N 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

14 

2782 

132 

280 

I 

Y 

L 

L 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

14 

2783 

133 

284 

V 

P 

P 

V 

L 

N 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

14 

2784 

134 

302 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

E 

14 


SEQ 
ID 

NO: HLA-DRB1*1101 15-mers 




Pos 

1 

2 

3 

4 

5 

6 

7 

8 

9 

0 

1 

2 

3 

4 

5 

score 

2785 

1 

145 

K 

I 

G 

V 

A 

A 

V 

V 

R 

G 

A 

A 

L 

M 

A 

28 

2786 

2 

122 

F 

D 

R 

Y 

V 

A 

I 

C 

H 

P 

L 

R 

H 

A 

T 

25 

2787 

3 

217 

S 

F 

S 

Y 

L 

L 

I 

L 

K 

T 

V 

L 

G 

L 

T 

25 

2788 

4 

197 

N 

V 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

24 

2789 

5 

10 

S 

A 

T 

Y 

F 

I 

L 

I 

G 

L 

P 

G 

L 

E 

E 

23 

2790 

6 

255 

P 

F 

I 

G 

L 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

23 

2791 

7 

44 

N 

L 

T 

I 

I 

Y 

I 

V 

R 

T 

E 

H 

S 

L 

H 

22 

2792 

8 

59 

E 

P 

M 

Y 

I 

F 

L 

c 

M 

L 

S 

G 

I 

D 

I 

22 

2793 

9 

158 

M 

A 

P 

L 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

C 

R 

22 

2794 

10 

237 

K 

A 

F 

G 

T 

C 

V 

s 

H 

V 

C 

A 

V 

F 

I 

22 

2795 

11 

74 

L 

I 

S 

T 

S 

S 

M 

P 

K 

M 

L 

A 

I 

F 

W 

21 

2796 

12 

134 

H 

A 

T 

V 

L 

T 

L 

p 

R 

V 

T 

K 

I 

G 

V 

20 

2797 

13 

137 

V 

L 

T 

L 

P 

R 

V 

T 

K 

I 

G 

V 

A 

A 

V 

20 

2798 

14 

162 

P 

V 

F 

I 

K 

Q 

L 

P 

F 

c 

R 

S 

N 

I 

L 

20 

2799 

15 

199 

V 

Y 

G 

L 

I 

V 

I 

I 

S 

A 

I 

G 

L 

D 

s 

20 

2800 

16 

224 

L 

K 

T 

V 

L 

G 

L 

T 

R 

E 

A 

Q 

A 

K 

A 

20 

2801 

17 

256 

F 

I 

G 

L» 

S 

M 

V 

H 

R 

F 

S 

K 

R 

R 

D 

20 

2802 

18 

290 

P 

I 

V 

Y 

G 

V 

K 

T 

K 

E 

I 

R 

Q 

R 

I 

20 

2 803 

19 

301 

R 

Q 

R 

I 

L 

R 

L 

F 

H 

V 

A 

T 

H 

A 

S 

20 

2804 

20 

65 

L 

c 

M 

L 

S 

G 

I 

D 

I 

L 

I 

S 

T 

S 

s 

19 


sd-63712 173 
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SEQ 
ID 

NO: HLA-DRB1*1101 15-mers 
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~> 

4 

b 

b 
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Q 
O 

Q 

y 

r\ 
U 
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7 
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7 
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2805 

O 1 
Z X 
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Li 
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X 

r 
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rl 
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Li 
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G 
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Z X O 

r 
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I 
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K 
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Li 
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1 
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A 
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F 
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F 
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2809 
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P 

V 
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H 
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oU 
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P 
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M 

L 
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F 
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2813 

7 Q 
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D 

A 
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Table XXIX. Nucleotide sequence in the 5' region close to 101P3A11 gene. (SEQ ID 
NO: 2846) 

1 TGCGCTCCAC CAAGCCTGGC TAACTTTTGC ATTTTTAATA GAGGCAGGGT TTCACCATGT 
61 TGGCCTGGCT GGTCTCGAAC CCCTGACCTT GCGATCTGCC CACCTCGGCC TCCCAAAGTG 
5 121 CTGGGATTAC AGGCGTGAGC CACTGTACCT GGCGGGGCTT ATTGTTTTTT AAAAAGATTT 

181 CCAAAACCTT GCCCTGGCAA TTCTGATTTT CTGGGCCTGG AGCAGGACCT GGAGGGATGG 
241 TGTTGTCAAT TACTTTAGAT GTTTCTATCA GGAAAGTTTG AGAAATGGTA TTCAGGCCTA 
301 AACACAAACC TCTCTTGAAA TCTCATCCCA GACTGAGCCC CTGCTCCCTA TCTTAAATTA 
3 61 GATTATAGTA GGTCTTAAAG TCAGCTGTAG ACTGAGCCTC TAAATCTGAA CCCAGACCCA 
10 421 CCCTAACCCC AGGATACATC AGAAGAGCTG GTCAATGTGG ACCATTCTGA GCAATCCTGC 

481 AAGTCTACTC TGATGGGAAA AGGCTAAGAG CAGTGCCCTG GGCAGCAACA TCAGCTCTGA 
541 AGATGCAGGA CTGTGTTACA TGTTTTATGA GTGGGTCTTC ACACACTGAG ATTCATGGGA 
601 CAGTAATAGA ATCTGCTTGT GCAGCACTGG GGCCTTGGAG GGTCAGGGTA AGGCTCAAGA 
661 TGTCCAGGAA GTTGTATATA AGGAGAATCA GAGCAGAGAG AGACTAGGGT TCAGAATTAC 
15 721 CAGGATGACT TAGTCCTGTT TGTTACTGTC ACCACTCCAA TGCCTTTTCC TCATTAGTCC 

781 TTTCTCTCCT CTGAGCCACA ACTAAATGAT GTTTCTACTT TTCCCTTTCT ACTTTCCTAG 
841 ACCCTGGATT TTGTATGCAG AAGCCCCAGC TCTTGGTCCC TATCATAGCC ACTTCAAATG 
901 GAAATCTGGT CCACGCAGCA TACTTCCTTT TGGTGGGTAT CCCTGGCCTG GGGCCTACCA 
9 61 TACACTTTTG GCTGGCTTTC CCACTGTGTT TTATGTATGC CTTGGCCACC CTGGGTAACC 
20 1021 TGACCATTGT CCTCATCATT CGTGTGGAGA GGCGACTGCA TGAGCCCATG TACCTCTTCC 

1081 TGGCCATGCT TTCCACTATT GACCTAGTCC TCTCCTCTAT CACCATGCCC AAGATGGCCA 
1141 GTCTTTTCCT GATGGGCATC CAGGAGATCG AGTTCAACAT TTGCCTGGCC CAGATGTTCC 
12 01 TTATCCATGC TCTGTCAGCC GTGGAGTCAG CTGTCCTGCT GGCCATGGCT TTTGACCGCT 
12 61 TTGTGGCCAT TTGCCACCCA TTGCGCCATG CTTCTGTGCT GACAGGGTGT ACTGTGGCCA 
25 1321 AGATTGGACT ATCTGCCCTG ACCAGGGGGT TTGTATTCTT CTTCCCACTG CCCTTCATCC 

1381 TCAAGTGGTT GTCCTACTGC CAAACACATA CTGTCACACA CTCCTTCTGT CTGCACCAAG 
1441 ATATTATGAA GCTGTCCTGT ACTGACACCA GGGTCAATGT GGTTTATGGA CTCTTCATCA 
1501 TCCTCTCAGT CATGGGTGTG GACTCTCTCT TCATTGGCTT CTCATATATC CTCATCCTGT 
1561 GGGCTGTTTT GGAGCTGTCC TCTCGGAGGG CAGCACTCAA GGCTTTCAAC ACCTGCATCT 
30 1621 CCCACCTCTG TGCTGTTCTG GTCTTCTATG TACCCCTCAT TGGGCTCTCG GTGGTGCATA 

1681 GGCTGGGTGG TCCCACCTCC CTCCTCCATG TGGTTATGGC TAATACCTAC TTGCTGCTAC 
1741 CACCTGTAGT CAACCCCCTT GTCTATGGAG CCAAGACCAA AGAGATC TGT TCAAGGGTCC 
1801 TCTGTATGTT CTCACAAGGT GGC AAGTGAG ACACCTTAGT GTCTCGCTTC T AC T AC T AC T 
18 61 ACAGAAGATG GGAATATTAG GATCCTATTG AATGCCTTGG TGATTAAAGT ATCAAACCTA 
35 1921 TTGTGCTGTC TTCTTCCAGC AATTTAAGTA GATCATGTAT TCTGTCTCCA GGAATGTGTC 

1981 AGTACTGAAC TTATGACCCT GTCTGGACAT CCTGGAGAAT GACTGCACTA GTCCCTCTGC 
2 041 TATGGTGGTC TTGCCTTCTC CTTCTCTCTC AGCTAGAAAA TACATCTAGT TTTGACATGG 
2101 GGAGGCTGTA AAGATCACAC CTCATGGTTC ATTCCAGTTT TGAAGTATGA TTTTAATGTT 
2161 CTTGCCCCCA TGTGCCCATG TTGGTGAATT TGCATGGACT ATAAACGTTA TTGCAAATAC 
40 2221 CCTAAAGTGG TTACCCAGCC ATAATCAGGG GTTAATGAAG GTATTTGGGG AATAGTAACT 

22 81 GGAGAGACAG CAACAAGACA AGAGGCAGCT CACATGCAAT GTTGAAGTTT CTGTATGCAA 
2341 GAGGGTGTGT TGGCAGATTT GTGAAATCTG CCCATTTGCA TCTGTATGGC TCTATATGAC 
2401 TATTTGTCCA TAAGGGTGCC ATGTATTCTG GTTGTGGGTG TGAATGTGTG GGTGTGTTTA 
2461 TGTGGACACT TGCTTTTCAG TGTGCGTATA TGTGAGAGAG AGGGTGCACA CATGGAATAC 
45 2521 GT AC TGGTTG TGTCCTGGTG AGTGTGGTAG CTATGTCCTG GCACATGTAT GTTTCATGAG 

2 581 ACGTGTCTCT GATTGCGCAT TTGTATTTCT GTGGTATCTG TTAGTTGGTA TATGATATGT 

2 641 GTCTACGTGA GAATGCTGGT GTCTGTATCT GCATGGTGGG CAGTACCTTT ATGTGTATCT 
27 01 GGTAAGAATG CTGCCTCTAC CTTTTCTTCC TATTTGTACT ATGTGAATGT GGTGCATGAA 
27 61 TGTGTGGAAT GTGTGGAATG TGTAGTATTG GGATGCCTGT ATCTTTCAGC GTGTTTGGGT 

50 2821 GTATGTCCAC TGTGCATAAT ATTTGAGATG TAAAACCATT TTGTGCGGTA TATGTGTTAT 

2881 TAGTTGTAAG TCGGTGAAAT GTACATCTGA ATTC TGTGTG CATATTGTTG GTACTGATGC 
2941 TATTTTCGTG CATATGTCTA GTGTATATGT TTTAAGGCAA ACTTTCTTTG TGTGTTGGGT 

3 001 GTGTATGTGA CACGAATGGG GACAGCATCT GTATTTCTGA GCATGGATTG ATGTGTGGTG 
3061 TCTGTATGTA TCTTGGAATG GAGGAGGGAG ATTGAAGAAG TCTGGCTGTG AGCAGCAGAA 

55 3121 ATAATTTCCA AAGTTGAGTG ACATGACTCT AAGATGCCCA GTTTCTCGGC CTGGGGTCAG 

3181 CCTGGGTGAT AGCTCAGTCT GTCAGAATGA AAGGAAACAC GGTGCTTCCT TGCTCCACCT 
3241 TTTCACAGGC CAGACCACAC CTTCTTCATC CTGAACACAA GGATTTCAAG GGCTTTTGTT 
33 01 ACCTCTTCCT ACGTTTCCTG CCTCTGCTAT CCGAGGCACT GGCCTCCCTA AACCCTGCCC 
33 61 TCCTGCCTCA ATAGCAAGTC ATGGTATCCT CACCTCTCCC TTCCCTTTTT GGCTTATCTG 

60 3421 CCAAACATGT ATAAAAGTCC TTGGTTCCCC ATCTCTACTA AAAATACAA C AATTAGCCGG 

3481 GTGTGATGGC GCGTGCCTGT AGTCCCAGCT AGTTGGGAGG CTGAGGCAGG AGAAACGCTT 
3 541 GAGCCCGCAA GGTGGAGGTT GCAGTGAGCC GAGATCATGC CACTGCACTC CAGCCTGGTG 
3 601 AC AGAGC AAG AC TC TGTGTC AAAAAAAAAA AAAAAAAAAA AGCCTTGGTT GTAGGGAGTT 
3 6 61 TCTCCTAATC CCTCTGGGAA AGCAAGGGTG GAGGGGAAGC CAGTCAATCT CCCTTCTGTT 

65 3721 GCCGCATGGA AACTCCCTTA AGGCAGGAAG CTGAAAAAAC TGTAGCATTC ACCTCATTAT 

37 81 TCACCTTGTC TCATGTCTCA CTGTCCTTCC ACATGTCTCA TTGTTACTCC ATATTGGATG 
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10 


15 


20 


25 


3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 


GAAGTAGAAG 
ATGGAGGCAG 
AGGCACCAGA 
TTGCCATGAT 
CACAGTTGTG 
TATTTATAGA 
TTGTTCTCTA 
TGTCATTGGT 
TGTCCTCAGC 
CTGGGGACCT 
GATGTAGCCC 
GGACCTAAAC 
AAATGGGCTG 
TGAGTTTGAT 
AGTTTCTTCT 
ATGAGGATCC 
CCTCCCCATA 
CACAGTGGGC 
CTGCAGGAAG 
GAGGTCCAAG 


TCCCTTTGGT 
CAGAGATGGC 
AACTGTTGTC 
CTATTTAACA 
A AACTGATCA 
TGTAGTTTTT 
ACTCTCTAAA 
CTTACGTGGC 
CAGAAGTTCA 
TGGTTATAAG 
TAGGGCTTTG 
AGTGTCCCCC 
ACACTTAACC 
AAAC CTGAGA 
AGCTTTTTTC 
CAGCAGCCCA 
CCCCCAGGCA 
TAGTGAGGGG 
AAGCAGAAGG 
GGAGTAGGCG 


ATTTTTTAAA 
TCCAGGGTTC 
ACCAGTAATT 
TAGCTTGATT 
GTAAAAAATA 


GTCTTTGCCA 
TGATAGCAAG 
TTGATGTGGT 
TAGCGTGTCC 
AGGGGAGACC 


TGTCTAAGTT 
TGTCAGGCTG 
CTGAGTTAGA 
TGTGTTC TG A 
AACTAAAAAC 


AATGAGGTTA 
CGTGCTCTGT 
ATGGTCTGAT 
ATTTAAAACT 
CATGTTGTTC 


ACTTATTTCA AAATACGAGG TATTTAGTTT TACATTCAAA 
ATGTTCTCTG ACTATTTTTG CCCTTAAGGG AGAAACCAGA 
TGGTGTTGGG GGTGGGGAGG GTTAAAGAAA CCACGTTCTC 
GTAATCCAAG GCCAGAGAGT GGACGGCAGA GGCACTGTCC 
TTATCCAGAC ACAGGGACCA GAGCCTGGGA GACAAAAAAA 
GGAAAAGGAG GATGGACC C A GTGAATTCCA CGCTTAGCAA 
AAATGAGAGA AGGGAGGACA GAAAGAACAC TTCAGGATGG 
GTGGAGTGTC TCTGCAAACT TCCTTTGCCA TTCTCCTGTT 
AGAGACTTGG ATAAAGACCG TCACGAAGAC TACACTAATG 
TACTCACTTT CCCTATCTAT CCTTCACATT GGGAGTTGGC 
TCAGGGGAGG ACTCTAGAGA TCCCTTTCCC CATTGCCTCT 
TATCCTCCCA GGGCACGGAA GCTGAGAAGC AGTCCAGAAC 
TACCTGCTGA TGTACCCTTT GGACAGCATT CTGCCCCACC 
AGGGAGAGGG TG AGGCAGAG AATAAATAAC CCTGACCAGG 
GAGAcagaga ggctgtattt cagtgcagcc tgccagacct 


Note: The three high score predictions of promoters were bold and underlined. 
The lower case sequence indicates the beginning part of the transcript of 
101P3A11 gene. 

Table XXX: Promoters and their positions predicted by Neural Network Promoter 
Prediction computer program. 


30 

Start 

End 

Score 

SEQ 
ID NO 

Promoter Sequence 


25 

75 

0 

.91 

2847 

TTTTGCATTTTTAATAGAGGCAGGGTTTCACCATGTTGGCCTGGCTGGTC 


665 

715 

0 

.95 

2848 

CAGGAAGTTGTATATAAGGAGAATCAGAGCAGAGAGAGACTAGGGTTCAG 


2477 

2527 

0 

.91 

2849 

TCAGTGTGCGTATATGTGAGAGAGAGGGTGCACACATGGAATACGTACTG 


3139 

3189 

0 

.82 

2850 

TGACATGACTCTAAGATGCCCAGTTTCTCGGCCTGGGGTCAGCCTGGGTG 

35 

3420 

3470 

0 

.96 

2851 

GCCAAACATGTATAAAAGTCCTTGGTTCCCCATCTCTACTAAAAATACAA 


4092 

4142 

0 

.99 

2852 

AACTGATCAGTAAAAAATAAGGGGAGACCAACTAAAAACCATGTTGTTCT 


4953 

5003 

0 

.97 

2853 

AGGCAGAG AATAAATAAC CC TG AC C AGGGAGGTC C AAGGGAGTAGGCGGA 
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Table XXXI: Alignment of five homologous 5 1 
of the human 101P3A11 and PSA genes. 


upstream genomic regulatory regions 


Query: 5' upstream regulatory region of the PSA gene 
5 Subject: Putative 5' upstream regulatory region of the 101P3A11 gene. 
Nucleic acid sequences predicted to be binding sites for the indicated 
transcription factors are bolded, underlined , or italicized. 

1. QUERY (SEQ ID NO: 2854) AND SUBJECT ( SEQ ID NO: 2855) 
10 NF-1 SP-1 NF-1 

Query: 3864 ccaggctggagtgcagtggcgcagtctcggctcactgcaacctctgcctcccaggttcaa 3 923 

II III MMIIIIIIIl llllllllllllllllllll III I II 1 1 1 1 

Sbjct: 3598 ccaggctggagtgcagtggcatgatctcggctcactgcaacctccaccttgcgggctcaa 3539 
15 Query: 3 924 gtgattctcctgcctcagcctcctgagttgctgggattacaggcatgcagcaccatgccc 3 983 

I I lllllllllllllllllll I I IIIIMI II I II II I II II II III 

Sbjct: 3538 gcgtttctcctgcctcagcctcccaactagctgggactacaggcacgcgccatcacaccc 3479 

Query: 3 984 agctaatttttgtatttttagtagagatgggg 4015 

20 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 3478 ggctaattgttgtatttttagtagagatgggg 3447 


25 


30 


35 


2. QUERY (SEQ ID NO: 2856) AND SUBJECT (SEQ ID NO: 2857) 

Query: 4670 cctgtaatcccagctactgaggaggctgaggcaggagaatcacttgaacccagaaggcag 4729 

II Ml lllllllll I lllllllllllllllllll I lllll III 1 1 1 1 I 

Sbjct: 3 496 cctgtagtcccagctagttgggaggctgaggcaggagaaacgcttgagcccgcaaggtgg 3 555 


NF-E 


GR 


GR 


agagtgagactc 47 89 

ml 111 


SPl 

NF-1 NF-1 

Query: 473 0 aggttgcaatgagccgagattgcgccac fc^ca ctccagcctgggt 

III lllll IIIIIIIIIII IIIIIIIIIIIIIIIIII 1 1 ITTTTTIT ITIIII 

Sbjct: 3556 aggttgcagtgagccgagatcatgccac tgca ctccagcct-ggt gacag agcaagactc 3614 
Query: 4790 tgtctcaaaaaaaaaaaa 4807 

mi iiiiiiiiiiiiii 

Sbjct: 3 615 tgtgtcaaaaaaaaaaaa 3 63 2 


40 


45 


50 


3. QUERY (SEQ ID NO: 2858) AND SUBJECT (SEQ ID NO: 2859) 

GR NF-1 SPl 

Query: 142 tgagactgagtctcgctctgtgcccaggctggagtacagtggtgcaaccttggctcactg 2 01 

III II 1 1 1 1 1 1 IIIIMI lllllllll iTtttti 1 1 1 I II lllllllll 

Sbjct: 3621 tgacacagagtcttgctctgtcaccaggctggag^cagtggcatgatctcggctcactg 3562 
Query: 202 caagctccgcctcctgggttcacgccattctcctgcctcagcctcctgagtagctgggac 261 

III 1 1 1 1 III III III II lllllllllllllllllll I MINIUM 

Sbjct: 3 561 caacctccaccttgcgggctcaagcgtttctcctgcctcagcctcccaactagctgggac 3 502 

NF-1 

Query: 2 62 tacaggcacccgccaccacgcctggctaannnnnnngtatttttagtagagatgggg 318 

lllllllll Mill III II MIMI II Ihlll II MINI 

Sbjct: 3501 tacaggcacgcgccatcacacccggctaa--ttgttgtatttttagtagagatgggg 3447 


55 
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10 


4. QUERY (SEQ ID NO: 2860) AND SUBJECT (SEQ ID NO: 2861) 

Query: 300 atttttagtagagatggggtttcactgtgttagccaggatggtctcagtctcctgacctc 359 

lllllll Mill MINIMI 1 1 1 1 Ml II 1 1 1 1 1 1 1 I MINIM 

Sbjct: 31 atttttaatagaggcagggtttcaccatgttggcctggctggtctcgaacccctgacctt 90 

SP1 NF-1 
LF-Al CP2 
Query: 3 60 gtgatctocccaccttggcctcccaaagtgctgggattacaggcgtgagccactgcgcct 419 

I 1 1 ITMTTTI 1 1 1 111111111111111111111111111111111117111 III 

Sbjct: 91 gcgat ctgccc acctcggcctcccaaagtgctgggattacaggcgtgagccactgtacct 150 


NF-1 

Query: 420 ggc 422 
15 Sbjct: 151 ggc 153 


20 


25 


5. QUERY ( SEQ ID NO: 


2862) AND SUBJECT 
NF-1 


(SEQ ID NO: 2863) 


NF-1 CP2 

Query: 4506 gccaggcacagtggctcacgcctgtaatcccaacaccatgggaggctgagatgggtggat 4565 

llllll IIII7TTTIIIIIIIIIIIIIIMI III MM II ||| Nil III 

Sbjct: 153 gccaggtacagtggctcacgcctgtaatcccagcactttgggaggccgaggtgggcagat 94 
Query: 4566 cacgaggtcaggagtttgagaccagcctgaccaacatggtgaaactctgtctcta 462 0 

I I IMII IN llllllllll I I III II IN INN 

Sbjct: 93 cgcaaggtcaggggttcgagaccagccaggccaacatggtgaaaccctgcctcta 3 9 


30 
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